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MAXIMUM VALIDITY OF A TEST WITH EQUIVALENT ITEMS. 
; LEDYARD R_ TUCKER 


COLLEGE ENTRANCE EXAMINATION BOARD 


It is assumed that a scale of true scores on a function exists and 
that the probability of answering an item correctly is a curve of 
the type of the integral of the normal curve. The product moment 
correlation between the test score and true score is derived for a 
normal distribution of subjects and a test ‘composed of equivalent 
items. Numerical examples demonstrate that the maximum correla- 
tion between ‘test scores and true scores occurs for a one hundred 
er a when the point correlation between items is less than three 

nths. 


One of the most common assumptions in the theory of mental 
tests is the existence of a continuum of true scores which, with errors 
of measurement, produce the observed scores on a test. The errors 
of measurement are assumed to be unrelated not only to other errors’ 
of measurement, but also to the true scores; that is, for a particular 
true score, the errors have the same mean and standard deviation as 
for any other true score. Many of the formulas and concepts in the 
theory of mental tests are based on these two assumptions. Mental 
testers have used test reliabilities and corrections for attenuation in 
their thinking about test results for so long that these two concepts 
have become almost second nature to them. A commonly stated propo- 
sition is that any increase in the reliability of a test will increase the 
test’s coefficients of validity. Since validity coefficients are difficult 
to obtain and work with, an intermediate goal of high test reliability 
has been established. New and improved types of items have been 
devised, more advantageous distributions of item difficulties have 
been investigated, and tests have been increased in length in order to 
increase the reliabilities of the tests. Although this work has resulted 
in better tests as indicated by increases in validity coefficients, criti- 
cal consideration of several theoretical possibilities reveals inconsis- 
tencies between higher reliability and better measurement along the 
assumed continuum, and thus possible lower coefficients of validity. 

Consider the case when all the items in a test are equivalent; 
that is, when the items all measure the same trait, have equal reli-_ 
abilities, and are equally difficult. In this case the items are equally 
intercorrelated with coefficients equal to the item reliabilities. It is 
common experience that such a test with items of usual reliability 
administered to a random population produces a distribution of scores 
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which approaches a normal curve. If the reliability of the items were 
increased to unity, all correlations between the items would also be- 
come unity and a person passing one item would pass all items and 
another failing one item would fail all items. Thus the only possible 
scores’ are a perfect score or one of zero. Now the question can be 
raised whether the “measurement” obtained with the first test was 
due to the unreliability of the items or the results were solely due to 
a single imperfect discrimination. Is a dichotomy of scores the best 
that can be expected from a test with items of equal difficulty? 

A resolution of these questions can be obtained when it is as- 
sumed that individuals with low true scores on the continuum of abil- 
ity are less likely to answer each item correctly and individuals with 
high true scores are more likely to answer each item correctly. Figure 
1 shows how the probability of success on an item might be related 
to the true scores on the ability. The curve drawn on this figure rep- 
resents an item that produces only a moderately good discrimination 
since the curve extends over a considerable range of ability scores. 
An item that gives a perfect discrimination would be represented by 
a vertical line located at such a score on the ability that all individuals 
with lower scores fail the item and all individuals with higher scores 
pass it. A completely invalid item would be represented by a hori- 
zontal line, for in this case the probability of passing an item is in- 
dependent of an individual’s score on the ability. Thus, if a single 
item is to be used, one represented by a vertical line would be pre- 
ferred, since it would give perfect discrimination. 

Consider the case, however, where a large number of equivalent 
items are to be used. In this case all of the items would be repre- 
sented by the same curve. If the probability of correct response curve 
for these items were a vertical line, the total score over all items would 
accomplish nothing more than the perfect discrimination of any sin- 
gle item. Let the curve take a form such as illustrated in Figure 1, 
then a measure of the probability of success on an item could be trans- 
lated into differential scores on the ability. If the total score on a 
test made up of a large number of such items is divided by the num- 
ber of items, a measure of this probability of correct response for an 
item is obtained. The total score then produces a valid rank order 
of the individuals and a test with imperfect items gives better meas- 
urement of the scale of ability than a test with perfect items. 

The foregoing proposition can be stated more precisely in mathe- 
matical terms. Item curves like the one in Figure 1 can be thought 
of, in the simplest case, as depending on two parameters, one for the 
general level of difficulty and another for the discriminative power 
of the item. A convenient index of an item’s difficulty is the score on 
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the scale of ability where the probability of correct responses is one 
half and will be designated by s;. The parameter for an item’s dis- 
criminative power can be described in terms of a coefficient of the 
amount of spread of the item curve. This coefficient, which will be 
designated o; , has an inverse relation to the discriminative power of 
the item in that a large o; indicates a flatter curve than a small o; 
and thus a lower discriminative power. An item which gave a perfect 
discrimination would have a o; equal to zero. Equivalent items will 
have equal s,’s and o,’s. The correlation, r;,, of a test made of n of 
these items with the scale of ability is, for a given population, a func- 
tion of s;, o; , and 7 as stated below: 

c Tts = $(8;,0;,). (1) 
Since it is desired to investigate the relation of 7;, and’ o;, it can be 
assumed that s; and n are constants. This relation will be developed 
in the following section for the special case where the population has 
a normal] distribution on the scale of ability; but, since most popula- 
tions tested approximate such a distribution, the results should hold 
approximately for most situations. 

Following is a summary of the notation that will be used. 
j = a subscript indicating an item. 
k= a subscript for an alternative item. 
t= a subscript indicating the test. 
$s = both the subscript for the scale of ability anda score on 
the scale of ability. 
pj, — the probability of a correct response to item 7 by an in- 
dividual with ability score s. 
$; =the s where pjs is one half. 
o;—a constant of the pjs, s curve which determines the 
spread of this curve. 
N = number of individuals within the population tested. 
f. = the frequency of individuals with ability scores of s with- 
in the population tested. 
fis =the frequency of individuals with ability score s who 
pass item 7. 
p; = the proportion of the individuals in the population who 
give the correct response to item j. 
fixs = the frequency of individuals with ability score s who 
pass both items 7 and k. 
pj. = the vroportion of the individuals in the population who 
pass both items 7 and k. 
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Tis = the product moment correlation coefficient between item 
mK and the scale of ability s. _ 

ra the product: moment correlation coefficient between ited 

. nud and item k. 

Tt mathe product. moment correlation coefficient between ie 
..¢ and the scale of ability s . 

omszthe number of items in the test. 


Several arbitrary ‘variables will be introduced during the mathemati- 
cal development and their designation will be explained in the con- 
text. 

In order to give substance to the development it is necessary to 
assume an equation for the curve of Figure 1, and,-since its general. 


‘ “ 
i ° oF op 
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FIGURE 1 


shape is.the same as the integral of the normal curve, the assumption 
is made that it can be represented by the following equation: 


u j= (8-85) /O4 
1 
Dig = — ee? du,. (2) 
sears V2a° 


j=- 





The arbitrary variable u; is used in order not to confuse this integra- 
tion with other operations on the s scale. 

A second necessity is to have a population to work with; there- 
fore, it is assumed that the distribution of scores on the s scale is 
normal with a unit standard deviation and a mean of zero. 

N 


f,=-——e™. (3) 
V2a 
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The first task of the development is to derive the equation for 
the frequency of individuals with score s who pass item j. Using the 
probability of correct response p;, as the proportion of the individuals 
with score s who pass the item, the following equation can be writ- 


ten: 


fis=Sfe Dis. (4) 
Substituting from equations (2) and (3), equation (4) becomes: 
uj=(S-85)/O . : 
fis sock oi oth. eHstu) day. ee (5) 
- 2a Uj=-0 r 


This equation for f;, is difficult to work with since s occurs as one of 
the limits of integration. This difficulty is eliminated by the following 
transformation of variables, s being held constant: 





Then: 
ma 1 
v; = (8 — o; us) ———. (6) 
V1 +o; 
1 ae 
uj =— [s—v; V1 + o;7], (7) 
oj 
) ee 
du; = v1 7 o;? dv;. (8) 


oj 


When this transformation is performed, equation (5) becomes: 


N T+o* vj=84/V14 Oj? : 
ee | e+ dy;, (9). 


226; 


y= 00 


where 








1+ o;? $V; 
A ot fs? + vjt—2—_—- ). 
2 V1i+o; 


Cj 
It is now advantageous to define: 
- | 
Figg SE teense, (10) 
Ln Mite etn Sin ren 
When equation ( 10) i is ‘substituted in. (9) ‘and. the limits of ‘integra: 


tion are interchanged accompanied with a change in algebraic sign, 
the equation for f;, becomes: 
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fresno  e8 do;, (11) 
2 a v1 ie a 


9=Tvs8j 





where 


1 
ee 5 (8? + 07? —28 0; Tr). 
— Tvs 





This equation gives the interesting fact that the variables v; and 
s have a normal surface of frequency with a correlation 7,,. The dis- 
tribution on v; is also a unit normal curve with the lower limit of in- 
tegration being the value of 7; which is predicted from s; by the re- 
gression of v; ons. 

The proportion of individuals in the population who pass item 
j can be obtained by integrating f;, for all values of s as follows: 


: 1 8=00 
pj = v J Ae ds . (12) 


When the right side of equation (11) is substituted for f;, in equa- 


tion (12), 
1 Vj=00 8=00 
j= 4 i es dv; ds. (13) 
22a V1—fe* a a 


This equation integrates to: 


vt g=00 1 
Dj = mame 6-17" dy, . (14) 
Vj=85Tp v2 7 


The proportion passing an item can then be found from equations 
(10) and (14) for any values of o; and s;. 

The next problem to be considered is the product moment corre- 
lation of an item with the scale of ability. Let the scores on an item 
be designated by z;. These scores are either 0 or 1. Equation (15) 
gives the formula for the correlation of item j with ability s: 


> 2;8 
%,=————.. 
N V Dj 9; 
The mean and o of the s’s do not enter in this formula since they are 


zero and unity respectively. Let the s’s for individuals with unit 
scores on the item be designated s,; then: 


2r%ys= Zs. (16) 








(15) 
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The individuals with x,’s of zero do not contribute to this sum. When 
this summation is expressed in integral form it becomes: 


3s,= fi fusas. (17) 


Substitution for f;, its value in equation (11) this becomes: 


>= 5d f ay ia se+8 ds dv; , (18) 
2 7 v1 me 5,4 Uj=8jTve $=-00 


which integrates to: 











N 115 
a... (19) 
1 
Define: 
1 
Wesel a ve (20) 
7 


Then substituting in equation (15): 


Ys, Tes) 





Vis = Tvs 


(21) 





| VD; i 
This relation is not unexpected since 7,, is the coefficient of the nor- 
mal surface between v; and s and is therefore the biserial coefficient 


of correlation. 7;, is the point biserial coefficient. Equation (21) is 
the formula relating these two coefficients. 

As a preliminary to deriving the formula for the intercorrelation 
between two items, j and k, it is necessary to find the equation for 
the proportion of the individuals who pass both items. Since the two 
items are assumed to be related only by the single ability s , the prob- 
ability, p;x., of individuals with the same ability passing both items 
can be written by the law of joint probability: 


Diks — Dis Drs} (22) 
and: : 
fine =f Dis Dus « (23) 
Substituting equations (2) and (3) into equation (23): 

Uj= (8-8 j)/0; Ue= (3-84) /O% 
fixs = sale ( f ent (uj eunt+s*) dy du, . (24) 


(2 7) at Uj-00 


k=-0O 
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The transformation of variables of equation (6), (7), and (8) is 
used to remove s from the limits of integration. Equation (24) then 


becomes: 











2 2 Vi=eo V_=CO 
PROT ok Ati te ll f edy,dy,, (25) 
(2 n) 3/2 Oj OK Vj=85T ve Un=SaToe 
where 
3? 1 +o? 1+o;? 8 
nS AS tind IO 5 ok Al 
o;? o;” oj? oi? 
1+ o? I+ oF 
(OS ce cS 
oi ox 
Since the items are equivalent 
ojo, (26) 
$j = Sx 5; (27) 


Tr, Of equation (10) will be used for both items: Equation (25) then 
becomes: 


N 1 j=00 V_=00 
 —— f f edv,dy,, (28 
5; (2 n) 8/2 (1 ts Fas") Cj=T 08 j Un=Toe8j , ( ) 


1 + t.,* 1 Tes 
D= + (v;? + v7?) —2 
1-— Tos* i~— Tus? eos, Tos? 
In order to find the proportion, p;, , who pass both items, fix. is inte- 
grated over all values of s: 





where 





8(v; + UK) - 








= 00 1 
m= fo wie ds. (29) 
Substituting the value of fj, in equation (28) into equation (29) 
gives: 
: f ee ine f ""e? dv; dv, ds. (30) 
= e? dv; dv ; 
1 (2 n)** (1 — F5,") &=-00 Vj=To08j Vr=T 0085 ’ ; 


The terms involving s can be collected, the square completed, and the 
integration over s performed so as to obtain: 


1 1 4=00 De=00 
Dix — eik dv; dv, , (31) 
22Vi-—-#F f 


Ose 





J=T0e8j  Ve=T 0984 
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where 
1 
 —— > (v;? +vu,Z2—2 VU; VE on. > 
UjivE 

and 

i Ka (32) 
or 

1 
(33) 





r = . 
mm 1 +? 


As in the case of the relation v; and s the bivariate distribution be- 
tween v; and v; is a normal surface, but in this case cut by two planes, 
one parallel to each of the axes. The proportions in the four quad- 
rants resulting from this double division by the two planes are the 
entries in the four-fold correlation table which has r vm 2S its tetra- 
choric coefficient. A further point of interest about r as is that when 


the effect of the ability s is partialled out, the coefficient of partial 
correlation ite is zero: 


2 


Tvs 
Vive 
r = =0. (34) 
U5UE.8 V1 cnitehes Ie Vil ie os" 








When the product moment coefficient of correlation between the two 
items is found, as indicated in the following section, the partial cor- 
relation found from it is not zero. 

Equation (35) is the formula for the product moment correla- 


tion between item 7 and k: 
Dik — Dj Pr (35) 
VP; 9; VPK 
Since the items are equivalent, 
Dj = De (36) 





Ti. = 





and 
a, am OS 
— Pay (37) 
DP; 9; 
The next problem is to obtain an equation relating the scores on 


a test composed of n items and the ability s. The formula for the 
correlation between the test scores, x;, and scores on ability s is: 





b> n20i (38) 


ts = 
N Crt 
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The variance of the scores on the test can be found by equation (39): 
o° = [n + n(n— 1) rx) D; 9; (39) 


since the variance of the individual items is p; q; and they are equally 
intercorrelated. The numerator of equation (38) can be written as: 


2 t18=Z(TV4Xj)s (40) 
which becomes: : 
Tes=LVIzjs, (41) 
or by equation (15) and the fact that the items are equivalent: 
DD 2:8=N rip VD; 4G; - (42) 
When equation (39) and (42) are substituted in (38) 


NT js 
(43) 





Tits oo 





Vn+n(n— Dre 


It is now possible to start with given values of s; , o; and n compute 
the correlation of the test with the ability. The important computa- 
tional steps are to find the values of: 


1) 7s from equation (10), 
2) p; from equation (14), 
3) rj. from equation (21), 


4) r »,trom equation (32), 


1 


5) py from equation (31), 
6) 7; from equation (37), 
7) 7 from equation (43). 


Figure 2 shows the relation of the correlation, 7;,, of the scores 
on the test with those on the ability and the major dependent vari- 
able o; when s; is zero and the test is composed of 1 item, 10 items, 
or 100 items. Only in the case when the test has a single item does 
1:, have its highest value when o; is zero, in each of the other cases 
the maximum 7,, occurs for a finite value of o; greater than zero. 
When a test is composed of more than one item its validity increases 
to a maximum value as o; increases from zero and then falls off again 
for less diagnostic items. Figure 3 shows that this same relation holds 
for tests whose items are not pitched at the mean of the population. 
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For illustrative purposes a value of s; equal to unity was chosen. Simi- 
lar curves could be drawn for items of other difficulties. 

The relation indicated above is given in more conventional form 
in Figures 4 and 5 where the correlation of the test with the ability 
is plotted against the intercorrelations of the items. If the scale of 
ability is taken as the criterion, 7;, is the coefficient of validity. The 
item intercorrelation, 7;,, can be interpreted as the reliability coef- 
ficient of the items. Figure 4 shows that a one hundred item test, per- 
fectly pitched in difficulty for the population tested, has a validity of 
slightly less than .8 for items with perfect reliability, but attains a 
maximum validity over .97 when the items have a reliability slightly 
over .2. 

A solution for the o; which would give a maximum r,;, for fixed 
values of s; and « would be of great interest but cannot be obtained in 
the general case since the equations (14) and (31) cannot be inte- 
grated except in special cases. The results illustrated in Figures 2-5, 
however, substantiates the conclusion that, for tests composed of items 
of equal difficulty, best measurement is obtained when these items have 
less than perfect discriminative power and reliability. 

A result which seemed amazing was the low values of the item 
reliabilities which yielded the best measurement. The item reliabil- 
ities for maximum test vs. ability correlations were within the range 
of practical experience. This fact would not, however, necessarily 
indicate that item reliabilities should be held down to the values indi- 
cated in the Figures. Since the curves rise sharply for low item reli- 
abilities and fall slowly for items with higher reliabilities, and since 
item reliabilities are unstable, it is safer for the reliabilities to be too 
high. 

Although the preceding development is of interest in pointing out 
an inconsistency between high test reliability and good measurement, 
most tests are constructed with a distribution of item difficulties; 
therefore, the assumption of equivalent items does not hold and the 
results are not directly applicable. It is hoped that an extension of 
the present system will indicate the validity properties of a test when 
the items differ in difficulty and that a system can be developed to 
determine the best distribution of item difficulties. 
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FIGURE 2 





FIGURE 3 
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FACTOR ANALYSIS AND BODY TYPES 


L. L. THURSTONE 
THE UNIVERSITY OF CHICAGO 


A factorial analysis was made of a small battery of twelve an- 
thropometric measurements. The correlations can be accounted for 
by four factors in a simple structure. This small battery has been 
used by the author for teaching purposes. Several of these factors 
seem to be meaningful, but their acceptance must depend on more 
comprehensive studies of body measurements with a larger number 
of measurements. 


In a recent paper* W. H. Hammond of University College, Lon- 
don, described a factorial analysis of twelve body measurements on 
adult men. After computing all of the intercorrelations, he factored 
the correlation matrix by a method of Cyril Burt. Hammond at- 
tempted to interpret the resulting factor matrix directly and with- 
out rotation of axes. It is the purpose of this paper to present an 
alternative factor analysis of the same correlation matrix which re- 
veals an interesting simple structure with rotation of axes. The writ- 
er has used Hammond’s data for teaching purposes because it is un- 
usual for a small battery of twelve variables to give so clear a simple 
structure with rather easily interpreted primary factors and with 
rank as low as 4. Because of the relative simplicity of this alterna- 
tive factorial interpretation it may help to clarify the nature and pos- 
sibilities of the simple structure concept. 

In dealing with the body type problem by factorial methods we 
did not consider types to be in the nature of discrete classes by which 
each individual may be categorized. It is doubtful whether students 
of the body type problem really believe in discrete categories without 
overlap. It seems more profitable to deal with the body type problem 
with a relatively small number of factors or coordinates in terms of 
which each individual physique can be described. If a person belongs 
to a type his physique would be conspicuous in one of the descriptive 
coordinates and not in the others. In such a coordinate system each 
individual might be described as a point in a descriptive space of as 
many dimensions as there are parameters, factors, or types, which- 
ever they might be called. It is then a question of fact whether indi- 


* Hammond, W. H. An application of Burt’s multiple general factor analy- 
sis to the delineation of physical types. Man, 1942, 42, 4-11. 
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viduals exist who would be described by points on the coordinate axes 
or in the coordinate planes. For some factors this might be physically 
possible whereas for others it might not be possible. 

In Table 1 we have reproduced Hammond’s correlation matrix 
for the twelve anthropometric measurements. It was factored by one 
of the centroid methods which produced the orthogonal factor matrix 
F, in Table 2. The absolute mean value of the fourth-factor residuals 
was .0311. Because the residuals were vanishingly small it was in- 
ferred that four factors were adequate to account for the variance 
in the given correlation matrix. 

The numerical values in the factor matrix F, of Table 2 are 
determined by the method of factoring that happens to be used. The 
particular factoring method is here of secondary consequence pro- 
vided that it produces a minimum distortion of the configuration in 
the common factors. Any legitimate factor method could be used be- 
cause the primary factors which are to be interpreted are determined 
by the test configuration and not by the factoring method. 

. Pairs of columns of F, were plotted and four successive adjust- 
ments were made in the rotation of axes. The transformation matrix 
A is shown in Table 3. This transformation matrix carries the given 
factor matrix F,, to the oblique factor matrix V of Table 4. We have 
then the matrix equation F,A = V. The matrix C = A’A is given in 
Table 3, and it shows the cosines of the angular separations between 
the reference factors. The configuration of the oblique factor matrix 
V is represented in the six diagrams of the accompanying figure. 
These six diagrams indicate no further adjustment in the four ref- 
erence axes A, B, C, D. The analysis of the final set of diagrams is 
the most important and interesting part of a factorial study. 

In the first diagram we have a plot of the projections of B against 
those of A. These are obtained directly from the columns A and B of 
Table 4. The two lines of this diagram represent the unit reference 
vectors A and B. All of these diagrams are drawn in orthogonal form 
for convenience rather than in oblique form because the structure is 
unaffected by this procedure except when the separating angles be- 
come quite small. The diagram BA shows three measures with sig- 
nificant saturation on the reference axis A. These are measures 8, 
9,10. In this study we have regarded saturations as significant if the 
factor loadings exceed .3 or .35, which mean that at least one-tenth 
of the variance of the measurement is attributable to the factor. Re- 
ferring to the list of anthropometric measures in Table 1 we find that 
these measures are head length, head breadth, and head height. The 
diagram BA shows that all of the other measures have vanishinelv 
small projections on the reference axis A. We find that there are only 
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three measures of head size and that all of them have significant sat- 
uration on A while all other measures have vanishingly small pro- | 
jections on this axis. This characteristic of the configuration leads/ +) 
us to the hypothesis that in the development of body proportions there 
exists a factor or parameter that is concerned with head size. 


In the same diagram BA we find that there are four measures 
with significant saturation on the reference axis B and that all of the 
other measures have vanishingly small projections on this axis. These 
four measures are stature, sitting height, span, and hand length. That 
which is common to these four measures seems to be bone length, and 
one is justified to formulate the hypothesis that in the development 
of body proportions some factor or parameter is primarily concerned 
with bone length. 

The diagram CA shows four measures with significant satura- 
tion on the reference axis C. These are measures 3, 4, 6, 7; namely, 
shoulder breadth, hip breadth, chest breadth, and chest depth. That 
which is common to these measures seems to be trunk size or girth. 
This characteristic of the configuration leads us to postulate a para- 
meter or factor that is primarily concerned with transverse meas- 
ures, which may be denoted girth. They are principally concerned 
with transverse measures of the trunk. 


The diagram DA shows four measures which have significant 
saturation on the reference axis D whereas all of the other measures 
have small projections on that axis. These are measures 1, 5, 11, 12; 
namely, stature, span, hand length, and hand breadth. These four 
measures include all of the measures of the extremities in this small 
battery of twelve variables. One might therefore postulate that there is 
a factor or parameter primarily concerned with size of extremities. 
The diagram DB is of special interest in our attempt to differentiate 
between the two factors D and B. In that diagram we see that the 
three variables 1, 5, 11 have significant saturations in both B and D 
but the situation is quite different with the measures 2 and 12; name- 
ly, sitting height and hand breadth. As far as this small battery of 
twelve variables is concerned, the differentiation between D and B 
hinges on the separation between sitting height and hand breadth. 
The factor B has a significant effect on sitting height but not on hand 
breadth. On the other hand the factor D has a significant effect on 
hand breadth but not on sitting height. 

The differentiation between these two factors, if they should be 
sustained in subsequent studies of larger batteries, would depend on 
the inclusion of additional measures which should represent a crucial 
separation between these two postulated factors. If several] rival hy- 
potheses should be formulated, then additional measures should be in- 
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cluded in a factorial study which would crucially differentiate between 
the several hypotheses in the hope that one of them might be sus- 
tained. That can be done only by analyzing a larger set of body meas- 
urements. 

In Table 5 we have the correlations between the four primary 
factors that correspond to the four reference axes. It will be seen 
that these four primary factors or parameters are essentially uncor- 
related, except B and C, which show a correlation of .38. These two 
primary factors have here been given the tentative interpretation 
length and girth, respectively. One might infer, therefore, that length 
and girth have a significant positive correlation, which is what we 
should probably expect. On the other hand, head size and the size of 
extremities seem to be two relatively independent factors. 

Table 6 shows the oblique factor pattern of this battery. In this 
table each significant factor loading of Table 4 is represented by an 
“zx” and the blank cells have vanishing saturations that can be ignored. 
This is an alternative representation of the simple structure which is 
useful for inspectional purposes. It shows at a glance, for example, 
that, as regards the common factors of this battery, stature can be 
represented as a linear combination of factors B and D. 

Summarizing this simple analysis, we have the following findings 
which should be regarded as tentative and subject to more intensive 
analysis on larger batteries. 


(1) The factors that determine body size and proportions 
seem to constitute a system that is simpler than the 
factors which determine the cognitive or intellective 
functions. We make this inference because similar fac- 
torial studies of the cognitive functions seldom reveal 
so clear a structure with only twelve tests and four 
factors. 


(2) The present battery seems to indicate the existence of 
four primary factors or parameters which are con- 
cerned with the development of: (a) head size, (b) 
bone length, (c) transverse measures or girth, (d) a 
factor less clear but which may be concerned with size 
of extremities. 


This paper will have served its main purpose if it should be pro- 
vocative of more intensive studies of larger sets of anthropometric 
measurements with the aid of the simple structure principle. There is 
good indication that it should be possible to identify those parameters 
or factors which operate to produce the individual variations in body 
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size and proportions. It is to be hoped that the factors that may be so 
identified will be directly related to the work of anatomists and endo- 
crinologists toward a scientific understanding of the mechanism of 


body growth. 


TABLE 1 
Correlation Matrix R 


D spa eoe wae tS. Oy FT 8 SI TE 








1. ew BL o.72 24 27 «81 21 06 25 04 .09 69 .41 
2. Sitting height ... .72 .JY .28 .28 51 29 11 16 02 16 .55 .15 
8. Shoulder breadth .24 .28 ™. 84 .41 48 .81 19 06 .06 30 . 
4. Hip breadth ...... 27 28 34 Ad ii 53 .47 26 04-04 35 .42 
eae 81 51 41 39 Of 33 12 22 15 11 .74 51 
6. Chest breadth ... .21 .29 48 58 33 §3% 36 .38 .13-02 .30 .37 
7. Chest depth ........ 06 11 31 47 12 36 Ar 10 04-03 .10 

8. Head length ........ 25 16 19 .26 .22 83 .10 32> 32 32 .22 .16 
9. Head breadth ... .04 02 .06 .04 .15 .18 .04 82 3S .85 .23 -01 
10. Head height ...... 09 16 06-04 .11 -02 -.03 .32 .35 2F. 4 08 
11. Hand length ...... 69 .55 80 85 .74 80 10 .22 .28 11 2% 68 


Hand breadth ... .41 .15 .28 .42 .51 .87 .20 16-01 .08 .68 4 


~ 
ie 











TABLE 2 
Orthogonal Factor Matrix F, 
I II II IV h? 
1, 72 j—565 —15 —11 8555 
2. 60 —34 —25 —.48 -1685 
3. 51 17 22 —.06 3410 
4, 59 21 386 —11 -5339 
5. 19 = =—.35 02 .08 7479 
6. .60 32 24 —.16 5456 : 
7. 34 34 29 —.26 3829 
8. ‘ 31 —.30 04 38902 
9. .28 29 —.36 32 3945 
10 .23 16 —.49 19 3547 
11 738 —.41 01 .23 8295 
12 58 —.15 88 31 5994 
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TABLE 38 
Transformation Matrix A C=A’A 
A B C D A B C D 
I 27 40 AT 46 A 1.00 —.14 00 —.07 
II 04 —.64 64 —.45 B —14 100 —38 —. 01 
iil —.70 —.42 58 382 C 00 —38 100 —.01 
IV 39 —50 —.17 -70 D —07 —01 —.01 1.00 
TABLE 4 
Oblique Factor Matrix V 
A B C D 
Re —.04 -76 —.08 45 
2. —.03 80 .00 01 
3. 05 .08 49 19 
4, —.02 01 64 22 
5. 02 52 15 55 
6. 10 01 65 10 
a: —.03 —.07 59 —.09 
8. 51 09 .23 .00 
9. 61 —.08 .05 11 
10. 57 10 —.11 01 
a4. 07 46 07 by 
12. —.07 01 84 .67 
TABLE 5 
Correlations of Primary Factors 
R,, = D(A’A)4 D 
T, Ts T. Ty 
T, 1.00 16 .06 07 
T, 16 1.00 38 03 
Te .06 388 1.00 .02 
- ha .07 .03 02 1.00 
TABLE 6 
Factor Pattern 
A B Cc D 
1. Stature x x 
2: Sittane NEIGH \.........-ccncsncseeee x 
3. Shoulder breadth .................. x 
a, Sin Wreea@th -.............:........... x 
Oe ee eee x x 
6. Chest breadth ....................-- x 
| ae ee eee x 
Se x 
So. tied Oreaatn. .................... x 
BU, ED SIPEG anno acacia oalk x 
Ee en x x 


12. Sand treadth =.:................. (x) x 
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SERIAL CORRELATION 
NATHAN JASPEN 
UNITED STATES EMPLOYMENT SERVICE 


Formulas are presented for triserial correlation, quadriserial 
correlation, etc., and for serial correlation in general. These for- 
mulas are based on well-known procedures outlined by Kelley, Peters 
and Van Voorhis, and others, and involve Pearson’s correction for 
“broad categories.” The formula for biserial correlation also may be 
developed following these procedures. The assumptions underlying 
serial correlation are that the segmented variable is basically contin- 
uous and normally distributed, and that all the segments which to- 
gether would form a whole normal distribution are present. 


Introduction 

Biserial' correlation has become a standard’ measurement tech- 
nique. Its use is indicated when the data are presented in the form 
of two mutually exclusive categories in respect to one variable and 
in quantitative scores in respect to the other variable. The assump- 
tions underlying such a use are that the dichotomous variable is bas- 
ically continuous and normally distributed, and that the two di- 
chotomies together form a whole norma! distribution. 

Pearson (2, 96-105) is responsible for the first biserial expres- 
sion for a correlation coefficient. Kelley (1, 248) presents the fol- 
lowing simplified form of Pearson’s biserial expression: 


(M. — M,) pq 
Z¢0 ; 





(1) 


Trois = 
The purpose of this paper is to present a development of formu- 
las of serial correlation for any number of categories, and more spe- 
cifically formulas for biserial correlation (following a procedure other 
than that of Pearson or Kelley), triserial correlation, quadriserial cor- 
relation, etc. This development stems from procedures outlined by 
Kelley and by Peters and Van Voorhis. They used these procedures 
to determine correlation coefficients when the variables were seg- 
mented into several categories (1, 167-171; 4, 399-402). They did 
not, however, express the procedures in the form of a formula; nor 
did they use them in their development of the formula for biserial 
r (1, 245-248; 4, 362-364). This article is written principally because 
it would seem desirable that formulas for triserial r, etc., be ex- 
plicitly presented. 


23 








24 PSYCHOMETRIKA 


The effect of serial correlation is to normalize the segmented 
distribution at the time that the correlation coefficient is obtained. If 
the number of segments is large and if the segmented variable is 
already normally distributed, the resulting correlation will be the 
same as a Pearson product-moment correlation. 


Symbolism 
The following symbolism will be adopted: 


Let y be a continuous variable, 
x be a continuous segmented variable, normally distributed, and 
r be the coefficient of correlation (linear) between x and y. 


Let a = the proportion of cases in the top (right-most) segment of 
the x distribution, 
b = the proportion of cases in the second highest segment, 
¢ = the proportion of cases in the third highest segment, etc., 


and 
f = the proportion of cases in the f-th segment of the distribu- 
tion. 
Thenat+b+ec+---=1. 
Let ga. =a, 
@—a-+bd, 


Qa =—at+b+c, ete, 
Q; —=a+b+--- + f =the area above the left boundary of the 
f-th, segment, and 
q;-1=—a+6+--- + up to but not including f = the area above 
the right boundary of the f-th segment. 
Let z,= the ordinate of the normal curve, assuming a unit normal 
distribution, at q., 
Zp = the ordinate of the unit normal curve at q, , etc., 
2; = the ordinate at q; , and 
2; = the ordinate at q;-.. 
Let Y, = the mean of the y’s in the top (right-most) segment of the 
x distribution, 
Y, = the mean of the y’s in the second highest segment, etc., and 
Y, = the mean of the y’s in the f-th segment. 


Let X, = the mean of the z’s in the top segment of the z distribu- 
tion, etc., and 
X, = the mean of the z’s in the f-th segment. 
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Except for a few modifications, conventional symbolism has been 
adhered to. The symbol q is sometimes taken in the literature to rep- 
resent the proportion of cases in one of the segments (i.e., q = 4:—2» 
a decumulated frequency), as well as the cumulation of proportions 
or frequencies from a given line of truncation to the end of the curve. 
In tables of the normal probability integral oriented in terms of q, 
and therefore in this paper, gq always represents the area from the 
given line of truncation to the end of the curve. 

Since the normal curve is symmetrical, the ordinates z are equiva- 
lent for complementary q’s. Consequently most normal tables orient- 
ed in terms of q carry the argument only from zero to .500, and it 
is therefore necessary to consult the complement of q for values of 
q higher than .500. This of course does not disturb the meaning of q. 

In this paper the upper segments of the x distribution (the de- 
sirable pole of the trait or measure in question) are placed to the 
right of the lower (or less desirable) segments of the normal curve. 


General Serial Correlation 
The basic formula for rectilinear correlation is 


_ sey 
ae,” (2), 





r 


The y variable is continuous and all of the values are known. The 
x variable is presented simply as an ordered sequence of categories 
with stated frequencies. On the assumption of a normal distribution, 
class-index values may be assigned each of the categories. 

The means of the respective segments will be taken as the class- 
index values of x. The mean of a segment of a unit normal distribu- 
tion, expressed as a deviation from the mean of the whole distribu- 
tion, is (1, 101; 4, 290): 

X,= 2p — 2a Pr Nit (3) 
G1 — Ua f 

In this formula the q’s are the only independent variables. The 
values z can be determined most readily from a table of the normal 
probability integral oriented in terms of q. 

The use of these means as class-index values representing the 
categories of x permits the computation of the coefficient of correla- 
tion between x and y. Pearson has shown that a coefficient so com- 
puted is in need of correction for “broad categories” (more specifi- 
cally known as the correction for class means) (3, 130, as reported in 
1, 167-171, and 4, 399-402). When the x variable is in terms of a 
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unit distribution and when no correction is necessary on account of 
the y variable, the corrected correlation coefficient is 
7 
peas (4) 
Oz 
In this equation the quantity o, is the calculated standard devia- 
tion of the index values (class means), and is less than unity, the 
assumed standard deviation of the true x values. 
From (2) and (4), and since the distribution is of unit area, 


w= =ry (5) 


oz Oy 





It may be noted that as the number of categories increases, the 
more nearly o,7, and in turn o, , approach unity; and the smaller the 
correction necessary in the correlation coefficient. 

In the case of biserial correlation, .7 will be more specifically 
known aS 1is; 7:r; in the case of triserial correlation ; 7 ouc2 in the case 
of quadriserial correlation; 7,uin: in the case of quintiserial correla- 
tion; etc. More generally it will be known simply as 7. 

In order to find a general formula for serial correlation, it will 
be necessary to determine Siz? and Szy. Sz can be shown to equal 
zero. 

Now 

Sz = aX, + bX, + eX. + + 


= DfX; 
= 31 ( ag ) (6) 


= 3 (2; — 2-) | 
=0. 


S22 = oF = aX? + OX? +6X2 ++: 
= Tf/X? (7) 
(2; — 2-1)? 


f 


Dry = aX,Y, + bX,Y, + X.Y. +: 
—aeet (8) 
= SY; (2; — 2-1). 


From (5), (7), and (8), it follows that: 
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“ei DY 7 (Zs — 2-1) 
(2; — Za)? 
a, f : fale 


It may be of some theoretical interest to express this formula 
in matrix notation. 
Let W and Y be column matrices: 


(9) 














WwW Y 
| 2a | | Y, 
2b — Bu i Y, | 
ne | a 
‘ i 4 
a | 


and let F be a diagonal matrix: 
1 














- 0 0 0 0 
a 
1 | 
0 _ 0 0 |i 
b | 
1 | 
0 0 as | 
| c 
|” 
Fo 0 | 
Then 
SY; (2; =" 2;-)= wy ’ 
and 
aie 2 
5 (2; a2 —wrw. 


Both of these results are matrices of the order 1 X 1. 

Let us indicate the scalar value of these matrices by enclosing 
them in braces, respectively, thus: {W’Y}; {WFW}. 

Then 

Wy 
r= se (10) 
o,{W'FW} 

Biserial Correlation 


Biserial r can be found by substitution in formula (9). It can 
also be found directly from Table 1. 
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TABLE 1 
Calculation of Biserial r 
ST @ Qa 4 Sin x y fa fx fay 
z 2? 
a q 0 z 0 ~ a z eY, 
a a 
4 2? 
b 1 0 Fd —_—- y, —sz — —zY 
q b b b b 





From the last three columns of Table 1, respectively, 
=r=0, 





- Ff f~tor £ 
>2? = 6? = — + —- = = — , 
a b ab ab 
and 
Sry = 2(Ya— Yo). 
Therefore ; 
Y.—Y Y, — Y,)ab 
mae v) _( »)a (11) 
a & Oy 
ab” 


which agrees with (1). 


Triserial Correlation 


Triserial r can be found by substitution in (9) or directly from 
Table 2. 














TABLE 2 
Calculation of Triserial r 
| | | 7 a 7 x y fa fa? fay 
2a Zz, 
a Qs 0 2a 0 eRe Y, %q — ZY, 
a a 
2% (z,—Z,)? 
b 1) q:: 2 2 b Y, 24%, b (z,—2,) Y, 
2p 2,7 
c c 





From the last three columns of Table 2, respectively, 
>7=0, 
2 —2q 2 Zz 2 
gen pat yey 


b Cc 
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Dry = 2aYa + (2% — Za) Yo — 2 Ye. 
Therefore 
' 26¥a t+ (2 — 2a) Yo — Ze 
rT (2 )¥,—% (12) 
E (%— Za)? 2? | 
oy | — + ———_ + — 
a b c 
Quadriserial Correlation 
It can be shown similarly that 
ZaVa + (2p — 2a) Yo + (2c — 2p) Vo — 2c. 
T quad = = = eet - 2 r; — ‘ (13) 
oo [+ st +=] 
a b c d 
Quintiserial Correlation 
It can also be shown that 
ZaYa + (2 — 2a) Yn + (Ze — 2) Ve + (2a— 2c) Ya— 2aYe 
Tquint = ae (z — 2)? (z —%)? (z — 2)? ei . (14) 
Sy |= + b a + c b + ad c +2) 
a b c d € 


Standard Error of Serial Correlation 
In 1914, several years after Pearson published his formula for 
biserial correlation, Soper published formulas for the standard error 
of biserial correlation (1, 248-249; 4, 365). Formulas for the stand- 
ard error of serial correlation in general have not yet been developed. 


Illustration 

Test scores are available for a group of 40 workers, of whom a 
foreman rates 10 outstanding, 24 adequate, and 6 weak. 

The test scores of the workers rated outstanding are: 115, 112, 
108, 104, 103, 100, 95, 92, 86, 85. 

The test scores of the workers rated adequate are: 110, 108, 106, 
105, 101, 100, 98, 95, 95, 92, 90, 90, 89, 88, 87, 87, 85, 84, 79, 78, 76, 
74, 73, 70. 

The test scores of the workers rated weak are: 90, 87, 82, 78, 
73, 70. 

The following statistics can be computed immediately from the 
test score data: Y, = 100; Y, = 90; Y. = 80; o, = 12.1120; a = .25; 
b = .60; c = .15. 
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A table such as Table 3 can be set up to find the other constants 
of formula (12). 











TABLE 3 
Normal Curve Functions 
f q a 
a 25 25 3178 
b .60 85 .2332 
c 15 1.00 0 





By substitution directly in (12), 
(.3178) (100) + (.23382 — .3178) (90) — (.2332) (80) 
3178? ‘ (.23382 — .3178)? ‘ .2332? 
25 .60 15 


Ti5 = 








12.1120 
= .584. 


The obtained correlation should not be accepted as the validity 
of the test without further inquiry into such important factors as the 
validity of the criterion, the rectilinearity of regression, and the 
heterogeneity of the sample relative to that of the applicant popula- 
tion for whom the test is intended. 
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A FACTORIAL ANALYSIS OF OLFACTION 


E. H. Hst* 
THE UNIVERSITY OF CHICAGO 


Twenty-one reagents were rated on a pleasantness scale by 182 
subjects. The tetrachoric correlation cients were determined 
for the variables. The correlation matrix thus obtained was sub- 
jected to a Thurstone analysis which gave rise to a 6-dimensional 
structure. Two of these were not clear-cut, and the last one appeared 
to be a residual factor. One factor, the benzene-ketone, revealed 
the chemical property of unsaturation and the physiological quality 
of possible trigeminal stimulation. The other two factors were 
marked by the presence of oxygen and nitrogen, respectively, and 
were tentatively labelled as a plant factor and an animal factor ac- 
cordingly. The intercorrelations of the primaries are high. 


Introduction 

There seems to be some justification in classifying the tactual, 
the auditory, and to-some extent the visual sensations under the head- 
ing of physical sense and the olfactory and gustatory sensations un- 
der that of chemical sense. Possibly because of the overgrowth of the 
corpus callosum in the human brain the growth of the archipallium 
is necessarily sacrificed. This may possibly explain why little is known 
about the more primitive olfactory sensation in human beings. For 
instance, such attempts at searching for the primary dimensions of 
olfaction as carried out by Zwaardemaker and Henning have never 
been critically accepted, and we still know almost nothing about the 
primary olfactory sensations, and still less about the differential ol- 
factory receptors. 

The present study aims at singling out the primary factors of 
olfaction by resorting to the modern factorial technique, which may 
at least throw light on the number of dimensions of primary olfac- 
tion. Unfortunately, a research on such a scale calls for a large sam- 
pling of olfactory stimuli which are unavailable for our purpose in 
war time. It should be pointed out that since our problem is essen- 





* The author wishes to express his gratitude to Professor L. L. Thurstone for 
his kindness in providing facilities which made this study possible and for his 
guidance on the factorial problem; to Doctor T. G. Andrews for his many in- 
valuable assistances and technical advice on the experimental aspect of this prob- 
lem; to Miss Florence Brown for her editorial assistance; and to Mrs. Virginia 
Brown for her generous assistance on the computational procedures. 
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tially a physiological one more objective measures than psychological 
ratings are desirable. Perhaps such measures as the adaptation rate, 
threshold values, and the like are the ideal variables for isolating the 
primary olfactory factors. To carry out such experiments on a large 
group of individuals with a large number of stimuli would call for 
a considerable length of time, however. ‘Time at our disposal did not 
allow us to go into such elaborated experiments. We therefore re- 
sorted to attacking this problem by making use of the psychological 
reaction towards a relatively small sample of stimuli as a good pre- 
liminary experiment. 

An earlier factorial study on the olfactory problem was carried 
out by Beebe-Center (1). Using Spearman’s method of tetrad dif- 
ferences he found a general factor from the correlations of rankings 
of fourteen odors by eight subjects. Due to such a small sample and 
his factorial approach, the result was not significant. A more recent 
work of Eysenck (8) made a reanalysis of the Beebe-Center data. In 
addition to the reanalysis Eysenck used fourteen subjects, eight men 
and six women, to arrange thirty-one smells according to a scale of 
predescribed frequencies for scores from 0 to 10. The stimuli were 
first sorted into three groups, namely, the definitely pleasant, the def- 
initely unpleasant, and the indifferent. Then a score of 10 was as- 
signed to the best-liked odor and a score of 9 to the next two best and 
so on. Burt’s least squares method was applied to the resulting correla- 
tion matrix between persons, and factor saturations were obtained. 
A general factor was found, while the residual coefficients were not 
significant. Eysenck attributed this general factor to biological rea- 
sons. She also presented the average ratings for each grouping of 
Zwaardemaker and believed the hedonic tones thus expressed were in 
agreement with his classification. 

Some less recent works connected with the olfactory problem are 
briefly reviewed here. A series of works were conducted by Elsberg 
and his associates (2, 3, 4, 5, 6, 7). Two new methods of quantita- 
tive measurement of olfaction were invented. One was called the 
“blast injection” method. In this method air is injected into the pa- 
tient’s nostrils, either monorhinally or birhinally, through a rubber 
tubing and a nosepiece by means of a tank of compressed air. The 
bottle containing the reagent is connected with the tank by rubber 
tubing at one end and with the nosepiece at the other. Thus the vol- 
ume of air injected into the patient’s nasal passages can be measured. 
The intensity of the odor is thus controlled by allowing a definite 
amount of air to pass through the mucous membrane of the nostrils. 
The patient is asked to hold his breath while the air is injected. By 
this method Elsberg determined the thrishold values of some twenty- 
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one odors. The other technique was the so-called “stream injection” 
method. This was devised for determining olfactory fatigue. In this 
method a continuous stream of air is injected into the patient’s nos- 
trils until the patient can no longer notice the odor. While the air is 
being injected he is asked to breathe through his mouth. Elsberg ob- 
tained from patients the olfactory fatigue data for coffee by a bi- 
rhinally injected stream of coffee odor for 1.5 to 2.0 minutes at a vol- 
ume rate of 6000 cc to the minute. He found that the olfactory fa- 
tigue of coffee did not interfere with the ability of the individual to 
reproduce by an effort of memory the sensation of that odor; however 
the reproduction necessitated a greater effort than that normally 
demanded. Elsberg assigned the identification of odor to the olfactory 
cortex and reception and perception to the thalamus or to its neigh- 
boring area. He also found that if the olfactory membrane was stim- 
ulated by an odor at the moment when the individual was concentrat- 
ing on the same odor a smaller volume of odor than that normally 
required could be identified. 


The Procedure of the Experiment 

A preliminary list of thirty chemical reagents was made accord- 
ing to the following criteria: (1) They had fo be relatively free from 
trigeminai stimulation as based on Von Skramlik’s findings (12). 
(2) They had to be chemically pure. (3) They had to represent a 
large variety of olfactory quality. Only twenty-two reagents, however, 
were available. One of these, mercaptan, was finally discarded due 
to its extremely offending nature. The list of the twenty-one reagents 
finally used in this experiment is as follows. 


1. 0-Eugenol 2. Benzaldehyde 3.a-Ionone 4. Die- 

thyl sulfide 5.Indol 6. Naphthalene 7. Eucalyptol 

8. Anethol 9. Acetone 10. Trimethyl amine 11. 

a-Pinene 12. Geraniol 13. a-Terpineol 14. Skatol 

15. 0-Xylol 16. Linalo6l 17. Toluene 18. Amy] ace- 

tate 19. Methyl salicylate 20. Limonene 21. Py- 

ridine 
These twenty-one reagents were then bottled in identical 500-ce con- 
tainers, each with a glass stopper. The quantities of the liquid re- 
agents varied from 2 cc (trimethyl amine) to 10 cc. The majority 
were 10 cc. Diethyl sulfide and pyridine were about 5 cc each, and 
indol and skatol were about 5 grams. 

With the exception of a few cases in which the experiment was 

conducted individually, the odors were given to the majority as a 
group experiment. The number of individuals in each group varied 
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from two to twenty-five. In most cases the group was small, with 
about four persons at a time. The following data sheet was given 
each subject: 


Name (optional) : Age: Sex: MF 
Check the fellowing: 
How many cigarettes do you smoke a day? 0 1-5 6-10 11-15 16-20 
Over 

Have you smoked within one hour before this experiment”? Yes No 

Do you have a cccld? No Slightly Definitely 
Instruction: 

You will be handed a stoppered bottle on which there is a number. 
At a signal from the experimenter you are to uncork the bottle, smell 
the cork and immediately restopper the bottle. As you smell the cork, 
pay particular attention to the relative pleasantness or unpleasantness 
of the odor. On the graded blank given you, identify the degree of 
pleasantness at which the odor was judged by you. Mark the scale for 
the number of the bottle smelled, in the proper column. 

At the next signal from the experimenter pass the bottle to the next 
subject and receive the next numbered bottle, waiting for the signal to 
unstopper and smell, and evaluate as described above. 

It is of utmost importance that you take this experiment seriously 
and attempt to give your best judgments concerning the odors. For your 
cooperation as subjects in this research project the experimenter and the 
staff of the Department of Psychology are extremely grateful. 





Reagent No.| Unpleasant Slightly Indifferent Slightly Pleasant 
unpleasant pleasant 





1 























3 





A section of the graded blank which covered the twenty-one re- 
agents is shown above. In addition to this written instruction the 
following oral instruction was given all subjects: “When you open 
the bottle, you are to open it little by little until you smell a distinct 
quality of odor. But be sure you smell a distinct odor, not any diffused 
odor attached to the outside of the bottle. If you cannot get a distinct 
odor, then smell the cork. If you have two different impressions of 
the same odor, then take your first impression as the basis for judg- 
ing the quality of the odor. Always sniff just slightly until you get 
the distinct quality of the odor. Don’t smell deeply any odor. If you 
cease to smell any odor at all, please tell me and take a little rest.” 
This additional instruction was given in order to avoid adaptation 
and to keep the laboratory free from odors leaking too much from the 
bottles while they were being opened. The laboratory windows were 
always opened in such a way that good ventilation was assured. The 
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bottles containing the odors were put far from the subjects before 
and after the main procedure. The numbered bottles were given en- 
tirely in random order, so as to eliminate the constant error of con- 
trast of the hedonic tone. When the group was small, the experi- 
menter uncorked the bottles for each subject in order to avoid adap- 
tation as much as possible and to give a relatively lengthy interval 
for each subject between smellings. It took about ten minutes on the 
average for each subject to finish the whole series of odors. 

The subjects included in this study consisted of 182 individuals, 
71 males and 111 females. With a few exceptions the subjects were 
students, the majority of them taking courses in psychology. The 
ages of the subjects ranged from 15 to 53 years, with a median of 23. 
Of 182 subjects 102 reported being non-smokers, 76 reported having 
slight colds and 48 reported having smoked within one hour of the 
experiment. 


The Result 

A brief analysis of the factors of sex, smoking habits, and colds 
did not reveal any significant difference as far as the ratings of the 
olfactory stimuli were concerned. Since the average age of the sub- 
jects was about 22 and the variation was small, the age factor seemed 
also insignificant. The frequencies of ratings of each reagent on the 
5-point scale were obtained first, and then the scale was dichotomized 
near the median. Since the scale was considered linear, by such dichot- 
omizing a four-fold tetrachoric table for each paired variable could 
easily be determined. Then the tetrachoric correlation coefficients 
were obtained according to Thurstone’s diagrams (11). The corre- 
lation matrix so determined is presented in Table 1. 

The correlation matrix was then subjected to a centroid analysis 
with Professor Thurstone’s new method (9). After the 6th factor 
had been extracted the residuals appeared negligible. None of the 
residuals was greater than .20, the standard error for a zero tetra- 
choric correlation in our case being about .21. The centroid matrix 
is given in Table 2. The residuals are shown in the upper half above 
the diagonal in Table 1. 

The rotation was carried out six times for all planes with the 
method of real length of vectors on a matrix multiplier machine. 
Plane A was then rotated with Professor Thurstone’s newly invented 
rotation method of least squares three times (10), Planes C and B 
four times, Plane E three times, and Plane F one time. It was found 
that the new rotation method proved to be the most powerful tech- 
nique yet discovered. In comparison with the older methods it saved 
a considerable number of rotations. The transformation matrix is 
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presented in Table 3. The simple structure is given in Table 4. 
From Table 5 it can be seen that the primaries are highly cor- 
related. There possibly exists some second-order factor. 


Interpretation of Result 

First it should be pointed out that the nature of rating in our 
experiment is essentially different from hedonic ratings on visual or 
auditory stimuli in aesthetic experiments. All of the previous work- 
ers, such as Beebe-Center and Eysenck, seem to have treated olfac- 
tory ratings and visual and auditory aesthetic ratings alike. They 
appear to us to differ in three ways: (1) In an aesthetic rating one 
may measure or describe some more fundamental quality in vision or 
audition, such as hue or pitch, besides a pleasantness or unpleasant- 
ness rating. But there is no other quality of olfaction more funda- 
mental than pleasantness or unpleasantness. (2) The aesthetic ratings 
on visual or auditory stimuli do not ordinarily give rise to such physio- 
logical reaction as nausea or headache, but the odors do if they are un- 
pleasant. (3) The reaction to odors is more spontaneous than that 
towards visual or auditory stimuli. In other words the variance of 
aesthetic rating on visual or auditory stimuli is larger than that of 
olfactory rating. For example, out of a number of lines one may 
either choose a line of three inches or one of two and a half as his 
preferred one, but he is almost bound to be offended by such an odor 
as mercaptan, if his olfactory receptors are not too different from 
those of the majority of people. Thus it seems quite logical for us to 
use the pleasantness-unpleasantness ratings on the olfactory stimuli 
to search for their more fundamental qualities and primary dimen- 
sions. 

It is to be noted in Table 6 that a positive manifold definitely 
exists in our simple structure. This fact leads us to believe that the 
olfactory correlation matrix resembles the correlation matrix of scores 
on intelligence tests more than do correlation matrices of person- 
ality traits, in which a positive manifold in general cannot be dem- 
onstrated. 


Factor A reveals these following variables: 


—Acetone ............ NS... —WTelmene. .nc2tax... 57 Diethyl sulfide .40 
DDI 65 ckscsscietiasee .73 Naphthalene ........ 51 


Amy] acetate and indol also appeared in this group but had relatively 
low loadings. An inspection of these variables would reveal only two . 
derivatives, namely benzene and ketone, with the exception of diethyl 
sulfide. The appearance of the thioether is probably due to some er- 
ror made in the experimental procedure under which the subjects 
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might not have really smelled the quality of this chemical but got some 
diffused odor attached to the outside of the bottle instead. Neverthe- 
less, it should be pointed out that all these reagents with high load- 
ings have something definitely in common psychologically, since they 
all give rise to a biting sensation, which is possibly due to the stimu- 
lation of the trigeminal nerve. Chemically, benzene and ketone are 
not alike, yet it is interesting to note that in this factor all the re- 
agents have double-bond in their structures ; hence all are unsaturated. 
The aromatic hydrocarbons are also grouped together by their load- 
ings, which reveal their chemical resemblance. 


Factor B consists of the following variables: 


Linalo@l ............ .50 Methyl salicylate .43 Ionone .............. 31 
Benzaldehyde .. .49 O-Eugenol ............ 43 
Anethol ............ 47 Geraniol ................ 35 
Terpineo) .......... 46 Amy! acetate ...... 35 


The striking feature of this factor is the presence of oxygen in all 
the reagents. Half of them are alicyclic compounds and practically 
all are constituents of plants. If one examines the structures of these 
chemicals one sees a single oxygen atom either attached to the carbon, 
or in a hydroxy] group, or combined with the methyl group. The oxy- 
gen common to all these reagents is responsible for the pleasant odor 
of this group. Thus this factor is both chemically and psychologi- 
cally meaningful. We tentatively label this factor as a plant factor 


The following variables are found in Factor C: 














/\ Toluene 54 Diethyl sulfide ...................... 36 
Pinene 51 Pyridine . 33 
~-Linalodl ....... iategsnseprenpeletate 48. Terpineol .26 
* Limonene 44 ._ Geraniol .25 








This factor is less clear-cut than the others. Possibly the last 
two reagents in this factor are not significant, since their loadings 
are low. The double-bond in their structures seems to be the only 
chemically common nature of these reagents. This factor thus con- 
sists of two groups, benzene (if pyridine is considered a derivative 
of benzene) and alicyclic compounds which appear to be a combina- 
tion of Factor A and Factor B. 


Factor D includes the following variables: 











y O-Evgenol 42 Eucalyptol . 84 
¢_,Pinene . aes .40 Trimethyl amine .................... 83 
y) Methyl salicylate .................. 40 =) Limonene WW... eee 30 

Skatol 36 it | Aaa ie ERR te Ste Bak OSC we EA .24 





The variables in this factor can be dichotomized chiefly into the 
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alicyclic and heterocyclic compounds. It is interesting to note that 
nitrogen is present in the heterocyclic compounds and in trimethyl 
amine. Nitrogen is connected with protein, which is characteristic 
chiefly of animals. This factor seems to indicate a combination of plant 
and animal. 


The following variables are found in Factor E: 


6 ORES ae ee 62 TET U1) (Re eae as ie 48 
oc, LEER CATES, .60 Trimethyl] amine .................... 36 
“Naphthalene .......................... 49 Meee? ecto: ounce secs sce 32 

Benzaldehyde  ........................ 38 . . Methyl salicylate ................. .28 


The structures of indol and skatol are given as follows: 


Indol Skatol 


—CH; 
H H 
H 


It is to be noted that the three compounds possessing nitrogen 
again appear in this factor. The resemblance of indol and skatol, 
chemically, physiologically, and psychologically, is revealed by their 
remarkably high loading on this factor. We have included only four 
reagents possessing nitrogen—indol, skatol, trimethyl] amine, and py- 
ridine, all of which are directly or indirectly related to animal con- 
stitution. The fact that pyridine or bone oil did not appear in this 
factor clearly indicates the possibility of grouping the other three to- 
gether. It is highly possible that an exclusive factor of composed re- 
agents possessing nitrogen would have appeared had we only includ- 
ed more such compounds in our sample. However, in the present case 
it seems safe to label this factor as an animal one because of the two 
highest loadings on this factor. 


The following reagents appeared in Factor F: 





PORE ReNOl nn... ne ncseennneeses 57 Benzaldehyde ...................-..-. .28 

‘Methyl salicylate ~................. 37 eRe eae ae RT SONS 26 
| Dee ee ae 326 /. Acetone -26 
,, Trimethyl amine .................... 34 Anethol 25 





There is only one high loading on this factor. The picture of 
this factor is not clear-cut, and it seems to have all the preceding 
factors combined in it. Thus it is not interpretable and is most likely 
just a residual factor. 
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In further studies it would be interesting to score such individuals 


marked deviation from the majority rating of degree of pleasantness. 
according to the regression weights derived from the factors. 


totally insensitive to some reagents, for instance, indol, or had a 
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Transformation Matrix, A 


A 


.53 
—.48 
—.32 

54 
—.25 

18 


B Cc 
5 46 
14 —08 
12 ~=«.68 
21 —.05 
24 —.55 
—29 3 


D E 

38 52 
—.01 —.16 
01 —.17 
—86 —.48 
—.26 65 
—.21 15 
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TABLE 2 
Centroid Matrix 
I II Ill IV V VI 
.29, 40 —24 —21 —.28 —.31 
42 14 11 06 40 —.18 
.09 11 .26 18 12 —.31 
32 —.27 ee | 28 —.06 15 
45 —33 —14 —.19 32 07 
04 —17 —31 .05 16 ool 
.26 19 —.383 —.40 13 81 
06 328 —.13 18 —.12 —.07 
48 —24 —.49 38 —.23 —.09 
16 —.27 12 —.30 25 —.35 
26 —.06 27 —30 —.18 21 
20 pe 10 22 —.14 14 
Bf 35 OT 13 19 = ik f 
28 —.39 13 —.35 mAh mT 
42 —.48 —.36 32 —14 —.25 
43 38 33 10 —.13 12 
49 —.27 10 17 —.37 14 
AT 09 —.11 21 23 15 
AT 40 —48 —22 —11 —.07 
By 05 10 —.14 —.33 15 
20 —.35 48 09 .06 —.20 
TABLE 3 


F 


39 
10 
—.11 
—.11 
—.10 
—.89 
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TABLE 4 


Simple Structure 


O-Eugenol ................ 


BOWMORE: 20e.. AivcnccstBincess 
Diethyl sulfide ........ 
ee ee 
Naphthalene ............ 
Eucalyptol ................ 
Amethol .. <i. ..c0.-cccccsssedh 
Acetone  .................... 
10. Trimethyl amine .... 
A: So See 
12... Geraniol. .cccic.ciccccsccn 
13. Terpineol .................. 
34. SHBG) <2 csecctcseda. 
: ES 6 > (5) Ue 
26... TANGIOO!  ics.ccssce.ss..c:: 
RG. DOMWGRE 2. .ccc2 secs 
18. Amyl acetate ............ 
19. Methyl salicylate .... 
20. Limonene .................. 
21. Pyridine: «................... 


SO DAM oR wy 


Benzaldehyde ............ 


A B 
—06 39 
02 49 
—08 31 
40 —02 
27 —01 
51 07 
—04 09 
14 47 
75 03 
—1l1 00 
05 00 
11 38 
—09 46 
11 —12 
73 —08 
05 50 
57 —03 
32 35 
09 43 
15 06 
12 —01 
TABLE 5 





E F 
00 51 
38 28 
—07 26 
05 —08 
62 08 
49 —07 
48 —09 
00 25 
00 26 
36 34 
21 —03 
—12 —01 
11 —08 
60 01 
07 36 
—01 06 
—02 05 
32 02 
28 37 
02 08 
04 15 


Cosines of Angular Separations Between Reference Vectors (A’A) and Primary 


Vectors (D(A’A)-1D) (Correlations between reference vectors are 
below the diagonal cells.) 


} A B 
A’ 1.00 51 
Bo 618 (1.00 
C 2A 03 
yp —B 00 
E’ 01 14 


4 .00 AT 


Cc 


—.60 
—.52 
1.00 
31 
—.15 
—.07 


D 


64 
62 
—.73 
1.00 
Al 
745 


E 
—.50 
—.52 

62 
—.712 
1.00 
.06 





42 


10. 


12. 





PSYCHOMETRIKA 


REFERENCES 


Beebe-Center, J. G. Pleasantness and unpleasantness. New York: Century, 
1983. 

Elsberg, C. A. and Levy, I. The sense of smell. 1. A new and simple method 
of quantitative olfactometry. Bull. Neurol. Institute of New York, 1985, 4, 
5-19. 

Elsberg, C. A. and Brewer, E. D. The sense of smell. II. A new principle 
for the classification of odors based upon their olfactory coefficients. Bull. 
Neurol. Institute of New York, 1985, 4, 20-25. 

Elsberg, C. A. The sense of smell. VIII. Olfactory fatigue. Bull. Newrol. 
Institute of New York, 1935, 4, 479-499. 

Elsberg, C. A. and Brewer, E. D. The sense of smell. X. A detailed descrip- 
tion of the techniques of two olfactory tests used for the localization of su- 
pratentorial tumor of the brain. Bull. Neurol. Institute of New York, 19865, 
4, 501-510. 

Elsberg, C. A. The sense of smell. XIV. The relation of the cerebral cortex 
to the olfactory impulse and the area of brain involved in fatigue of the 
sense of smell. Bull. Neurol. Institute of New York, 1987, 6, 118-125. 
Elsberg, C. A. and Stewart, J. Quantitative olfactory tests. Archives Neurol. 
and Psychiatry, 1938, 40, 471-481. 

Eysenck, M. D. An experimental and statistical study of olfactory prefer- 
ence. J. exp. Psychol., 1944, 34, 246. 

Thurstone, L. L. Multiple factor analysis. (In press) 

Thurstone, L. L. A single plane method of rotation. (In press) 

Thurstone, L. L. et al. Computing diagrams for tetrachoric correlation co- 
efficients. Univ. of Chicago Bookstore, 1935. 

Woodworth, R. S. Experimental psychology. New York: Holt, 1938. 











PSYCHOMETRIKA—VOL. 11, NO. 1 
MARCH, 1946 


GRAPHICAL AND TABULAR AIDS FOR DETERMINING 
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WHICH INVOLVE COMPARISONS 
OF PERCENTAGES 
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When one knows in advance the minimum percentage difference 
whjch will be of practical interest in an investigation, principles of 
economy suggest the desirability of an accurate estimate of the size 
of the sample to be used. Charts are presented which eliminate any 
computation where the two groups involved are to be equal. A table 
is included for the case where one group will be from one to three 
times the size of the other. The charts are also useful for determin- 
ing whether or not obtained differences are statistically significant 
at the 5 per cent or the 1 per cent level. 


In planning an experimental investigation it is desirable to de- 
termine in advance the number of cases which are necessary in or- 
der that certain results shall be considered statistically significant. 
An underestimate leads to inconclusive results; an overestimate may 
entail needless labor and expense. Such a problem recently came to 
the writer’s attention. In a particular investigation percentage dif- 
ferences of ten or fifteen points were considered meaningful; that 
is, in this instance only differences of at least ten or fifteen points 
would be of practical value, and the use of groups large enough to 
establish the statistical reliability of differences smaller than these 
would not be necessary. For how large a sample are differences of 
this magnitude significant at the 5 per cent level? at the 1 per cent 
level? If the nature of the data is such that all percentages tend to 
be very high or very low, then how much smaller may the groups 
be if they are still to yield statistically reliable results? 

Examination of published tables and graphs which short-cut com- 
putations for comparing frequencies or percentages failed to reveal 
a ready answer to this particular problem.* Most of the graphs are 
designed for the experimenter who has his data collected and his num- 
ber of cases, therefore, fixed. A change in the number of cases neces- 


* For a comprehensive list of such references see Donald W. Fiske and Jack 
W. Dunlap. A graphical test for the significance of differences between frequen- 
cies from different samples. Psychometrika, 1945, 10, 225-29. 


43 








44 PSYCHOMETRIKA 


sitates a new graph. The advantage of the charts to be presented 
here lies in the fact that the number of cases, as well as the percent- 
ages and the differences, is a variable. The fixed values are the criti- 
cal ratios, which have been selected at the 5 per cent and the 1 per 
cent points. 

The discussion which follows is written in terms of proportion 
instead of percentage. If, as is most frequently the case, the null hy- 
pothesis to be tested is that no population difference exists, then the 
appropriate estimate of the population proportion is 


Nip, + Nope 
N,+Ne- 


where p without a subscript is the weighted average of the propor- 
tions, p, and p., for the two groups, and N, and N; are the numbers 
of cases involved. The square of the standard error of either ob- 
served proportion is pg (where g—1—p) divided by the correspond- 
ing number of cases, and the standard error of the difference 
(d=p,—p.) is found by means of the formula, 





p= ; (1) 








cage (2) 
a— pq NN. 
The critical ratio is computed from 
—- N,N, a? 
CR? = ; (3) 
If N./N, be denoted by k, then formula (3) may be written 
— kN, d? 
emcee, (4) 
(1 + k)pq 
from which 
(1+ k) pq CR? 
= ; 5 
} kd? (5) 
The total number of cases is then given by 
1 + k)*pq CR? 
akin doe, eS. (6) 





kd? 


Many problems, such as those dealing with test-item evaluation, 
involve equal groups, in which event k is equal to 1, and formula (5) 
reduces to 
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2 nq CR? 
ue. 
ad? 


For a given critical ratio and a given difference, d, the value of N 
reaches a maximum when p = q = .50, and becomes smaller as p 
approaches 0.00 or 1.00. Accordingly, values of N have been computed 
for values of pq corresponding to p, = .08 to p, = 1.00, where p, 
represents the larger proportion, and for values of d from .08 to .30. 
This has been done at the 5 per cent level (CR = 1.960) and at the 1 
per cent level (CR = 2.576). The results are presented in graphical 
form in Charts 1 and 2. Thus, for example, from Chart 1 it can 
readily be seen that any difference of 10 per cent is statistically sig- 
nificant at the 5 per cent level only if there are at least 192 cases in 
each grqup. From Chart 2, on the other hand, it is found that for 
192 cases in each group a difference of 10 per cent is significant at 
the 1 per cent level only if the larger proportion is about .23 or 
smaller or about .87 or larger. Otherwise up to 332 cases would be 
required in each group. 

The charts can also be used to determine whether or not an ob- 
served difference is significant at either of the critical levels for which 
the charts were drawn. Usually one wishes to know only the rela- 
tion of the observed data to the 5 per cent or the 1 per cent point; 
the exact amount of deviation from the critical point is less impor- 
tant. Suppose, for instance, one has the following data: N, = N. = 60, 
), — .85, Po = .18. The point on Chart 1 for p, = .35 and N = 60 is 
approximately on the curve for d = .16. The observed d of .17, there- 
fore, is statistically significant at the 5 per cent level. The corre- 
sponding point on Chart 2, however, is very near the curve for d = .20. 
Hence the observed difference does not reach the 1 per cent level of 
significance. (The actual critical ratio for this example is 17/.0806 
= 2.11.) 

The problem is somewhat less simple when the two groups differ 
in size. Minimum values of N, and N, have been computed for selected 
values of k and d by employing formula (5) with pq = .25 and with 
CR = 1.960 for the 5 per cent points and CR = 2.576 for the 1 per 
cent points. These values, given in Table 1, correspond to the maxi- 
mum points on curves similar to those in the charts. Indeed, the last 
line of Table 1 gives the maximum points on the curves in the charts. 

In order to make use of the table it is assumed that the investi- 
gator knows approximately the relative sizes of the groups to be com- 
pared and the minimum percentage difference which would be of 
practical import. For example, he may be sampling a college popula- 
tion in which there are about twice as many freshmen as seniors, and 


(5a) 
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he may wish to obtain a sufficiently large sample that a difference of 
-20 would be statistically significant at the 1 per cent level. Since 
one group is twice as large as the other, he would enter the table in 
the row corresponding to k = 2.0, and he would find the bold face (1 
per cent) values of N, and N, in the column for d = .20. These val- 
ues are 62 and 124. He will therefore require 186 cases, of which 
124 are freshmen and 62 are seniors. It is not so important that he 
use these exact numbers as it is that he take care not to make an un- 
duly large under- or overestimate in planning his investigation. 

If it should be known in advance that the observed proportions 
will lie near one of the extremes, so that pq will not exceed some 
fraction of its maximum value of .25, say pq/.25 < m, then the mini- 
mum sample necessary for significant results is m times that entry 
in Table 1 which corresponds to the k and the d under consideration. 
For example, suppose the items on a particular questionnaire to be 
consistently answered in a certain way by 80 to 100 per cent of the 
respondents. Suppose further that a percentage difference of 8 or 
more is an important difference, and that one of the groups under 
comparison is 1.5 times as large as the other. For k = 1.5 and d = .08, 
Table 1 indicates that a total group of 432 + 648 = 1080 cases would 
be required for significance at the 1 per cent level when pq = .25. 
In the present example, however, the maximum pq will be only .16, 
from which m is .64. The product, (.64) (1080), gives 691 cases as 
the total number required for the experiment. There should then be 
276 cases in one group and 415 cases in the other. 

With regard to interpolated values from the table, it should be 
pointed out that the only linear relation is that for successive N.’s in 
a column. Since N, changes slowly, however, vertical linear inter- 
polation for N, will not introduce much error. This value can also be 
determined from the expression, N, = N./k. Horizontal linear in- 
terpolation will tend to overestimate the N’s by a small number of 
cases. 

Both the charts and the table should serve to emphasize the im- 
portance of an adequate sample in any investigation which involves 
comparison of percentages. It is obvious that, unless one is prepared 
to ignore differences of less than 20 percentage points, a total group 
of more than 100 cases is necessary for significance at the 5 per cent 
level and one of more than 200 cases, at the 1 per cent level. In the 
event that smaller differences have meaning for the experimenter, he 
can establish their existence beyond reasonable doubt only by em- 
ploying very large samples. 
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N, C4 =, | | 
\N, N,|N, Np|N, N,|N, No| N, No| Ny No |N,N,|N,N|N, Np 
! | | | 

| 

8.0 | 200 600 | 128 384 89 267 65 196 50 150 40 119 | 82 96 | 20 61 | 14 438 
| 346 1037 | 221 663 | 154 461 | 113 339 86 259 68 205 | 55 166 | 35 106 | 25 74 
| 

2.8 | 204 570 | 130 365 91 253 66 186 51 143 40 113 | 38 91 | 21 68 | 14 41 
| 352 985 | 225 630 | 156 438 | 115 322 88 246 69 195 | 56 158 | 36 101 | 25 70 
| 

2.6 | 208 540 | 183 346 92 240 68 176 52 135 41 107 | 33 86 | 21 55 | 15 38 
| 359 933 | 230 597 | 159 415 | 117 305 90 233 71 184 | 57 149 | 37 96 | 26 66 

2.4 | 218 510 | 136 327 94 227 69 167 53 128 42 101 | 34 82 | 22 52] 15 36 
| 367 881 | 235 564 | 163 392 | 120 288 92 220 73 174 | 59 141 | 38 90 | 26 63 
| 
| 

2.2 | 218 480 | 140 307 97 213 71 #157 55 120 48 95 | 35 77 | 22 49 |] 16 84 
377 829 | 241 531 | 168 369 | 123 271 94 207 74 164 | 60 133 | 39 85 | 27 59 

2.0 | 225 450 | 144 288 | 100 200 73° «147 56 113 44 89 | 36 72 | 23 46/16 382 
| 389 778 | 249 498 | 173 346 | 127 254 97 194 77 #154 | 62 124 | 40 80 | 28 55 
| 

1.9 | 229 435 | 147 279 | 102 193 75 142 57 109 45 86 | 37 70 | 28 45 | 16 $1 
| $96 752 | 253 481 | 176 334 | 129 245 99 188 78 148 | 63 120 | 41 77 | 28 53 
| 

1.8 | 283 420 | 149 269 |] 104 187 76 187 58 105 46 83 | 37 67 | 24 438 | 17 30 
| 403 726 | 258 464 | 179 323 | 132 237 | 101 181 80 143 | 65 116 | 41 74 | 29 52 
| 

iT | 288 405 | 1538 259 | 106 180 78 1382 60 101 47 80] 38 65 | 24 41 |17 29 
| 412 700 | 263 448 | 183 311 | 134 228 | 103 175 81 138 | 66 112 | 42 72 | 29 50 
| 

1.6 | 244 3896 | 156 250 | 108 173 80 127 61 98 48 77 | 39 62 | 25 40/17 28 
| 421 674 | 270 431 | 187 299 | 138 220 | 105 168 83 133 | 67 108 | 43 69 | 30 48 
| 

1.5 | 250 375 | 160 240 | 111 167 82 122 63 94 49 74] 40 60 | 26 88] 18 27 
| 432 648 | 276 415 | 192 288 | 141 212 | 108 162 85 128 | 69 104 | 44 66 | 31 46 
| | 

1.4 | 257 360 | 165 230 | 114 160 84 118 64 90 51 71] 41 58 | 26 87] 18 26 
| 444 622 | 284 398 | 197 276 | 145 203 | 111 156 88 123 | 71 100 | 45 64 | 82 44 
| 

1.38 | 265 345 | 170 221 | 118 153 87 113 66 86 52 68 | 42 55] 27 86 | 19 26 
| 459 596 | 293 382 | 204 265 | 150 195 | 115 149 91 118 | 73 95 | 47 61] 33 42 
| 

1.2 |! 275 880 | 176 211 | 122 147 90 108 69 83 54 65 | 44 53 |] 28 84] 20 23 
| 475 570 | 304 365 | 211 253 | 155 186 | 119 143 94 113 | 76 91 | 49 58 | 84 41 
| 

1.1 | 286 315 | 1838 202 | 127 140 94 103 72 #79 57 62] 46 50] 29 82] 20 22 
| 495 544 | 317 348 | 220 242 | 162 178 | 124 136 98 108 | 79 87] 51 56 | 35 39 
| 

1.0 | 300 300 | 192 192 | 183 183 98 98 75 #675 59 59 | 48 48 | 31 81] 21 21 
| 518 518 | 332 332 | 230 230 | 169 169 | 130 130 | 102 102 | 83 83 | 53 53 | 37 87 
| 
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A SIMPLE GRAPHICAL METHOD FOR ORTHOGONAL 
ROTATION OF AXES 


WAYNE S. ZIMMERMAN 
UNIVERSITY OF SOUTHERN CALIFORNIA 


A simple method of making orthogonal rotations by projecting 
coordinates from one plot to another is presented. An easily con- 
structed apparatus for rapid and accurate work is described in de- 
tail. By the technique proposed 12 to 15 rotations of pairs of axes 
penton 20 test variables can be completed per hour by a practiced 

echnician, 


A principal source of resistance’against applying the technique 
of factor analysis is that it is overly time consuming. In particular, 
rotation to simple structure involves considerable computational work. 
This is especially true if the method of rotation used involves one ro- 
tation at a time with computation of new coordinates after each ro- 
tation. The method herein described is one by which the time in- 
volved in the plotting and rotating of axes can be reduced to a mini- 
mum. By the method proposed, disregarding planning time, approxi- 
mately 15 pairs of reference axes each involving 20 test variables can 
be rotated per hour by an experienced operator. 

The technique described is a graphic one for orthogonal rotations 
and makes use of the principle of projection. Coordinates represent- 
ing the test loadings in one factor are projected horizontally, and co- 
ordinates representing the test loadings on the factor with which the 
first factor is to be rotated are projected vertically, to a common meet- 
ing point. Each meeting point represents the position of a test on 
the two axes to be rotated. Oblique rotations can be introduced after 
orthogonal transformations have been completed. 


The Apparatus 


An apparatus for rapid and accurate projection of points can be 
constructed simply by making use of a 24” X 30” drawing board, a 
20” X 30” sheet of ten-squares-to-an-inch graph paper, a 30” T square, 
and an 18” 30°-60° triangle. Modifications of these instruments to 
adapt them to this use are described below. See Figure 1. 

The graph paper is lined up with the sides of the board by use of 
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the T square, and then fastened securely to the surface of the board.* 
The T square lies horizontally along the board, with the arms snugly 
against its left-hand edge. Attached to the upper edge of the T square 
and extending 12 inches from the upper arm is a length of transpar- 
ent celluloid or plastic on which is scratched a hairline running par- 
allel with the edge of the T square. The hairline can be scratched 
with a scribe and can be blackened by running a ruling pen filled 
with India ink down the groove.+ 

Along the short leg of the triangle, and parallel to the edge, is 
drawn another hairline, which is a direct continuation of the line 
on the attachment to the T square. Along the other leg of the tri- 
angle, and parallel to the edge, is a third hairline. At the point of 
intersection of these two lines on the triangle, a hole is drilled large 
enough to allow a pencil point to be inserted.t The hole should be 
countersunk from the top to facilitate the entrance of the pencil lead 

*The graph paper may be mounted on a separate flat surface for initial 
plotting and final reading purposes. In this case one or two horizontal lines and 
one or two vertical lines, drawn in appropriate positions on the rotating board, 
will suffice for guides to Jocate the plots. By this system the plots can be ad- 
justed more quickly since less perceptual distraction is introduced. For best re- 


sults the board should be painted white and the lines drawn with black ink. 
+ All lines should be drawn on the under side to avoid effects of parallax 


in reading. F 
t Use of automatic pencil is desirable as the hole can be drilled to fit the con- 


stant diameter of the lead. 
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and also slightly countersunk from the bottom to keep lead which 
may cling to the edge of the hole from smearing the paper when the 
triangle is moved. Next to the pencil hole an opening can be cut large 
enough to record the number of the variable represented by the point. 
A knob attached above and to the right of the hole will be found 
helpful in making triangle adjustments. A similar knob may be placed 
on the T square. 

Letter-size onionskin paper is used for plotting. To facilitate 
manipulation of these sheets of paper, it has been found advisable 
to punch notebook-size holes in the upper-right and lower-left corners 
and to place over each hole a piece of masking tape. When the sheet 
is adjusted on the board, pressure over each hole with the thumb or 
finger will keep the sheet of paper in place. The taped holes will re- 
main adhesive for several days after the removal of the shect from 
the board.* 


Procedure 


To start the rotations the centroid loadings must be plotted from 
the numerical values. Only nonoverlapping pairs of axes need be 
plotted to begin with, that is, factors 1 and 2, 3 and 4, and so on. 
This should be done with great care, using a sharp pencil point. From 
initial plotting to the completion of the rotational process, the load- 
ings are projected graphically from one pair of sheets to the next 
sheet. Only at the end are the numerical projections read from the 
plots to form a table of rotated loadings. 

The plotting of centroid axes is simply accomplished by laying 
a sheet of paper on the board and locating the points by means of 
the grids on the underlying graph paper, which can be seen plainly. 
The two axes so plotted may be rotated by turning the sheet until 
the horizontal and vertical graph lines which intersect at the origin 
are located in the desired positions relative to the plot on the sheet. 
The next axes are then drawn using the T square and triangle. 

When a rotation is planned with two factors, each of which has 
already been plotted, e.g., 7;+ and IJI. in Figure 2, the procedure is as 
follows: 

(1) In the lower left-hand portion of the board adjust the sheet 
of paper on which the last rotated plot for one of the factors appeared. 
It does not matter, of course, what other factor is plotted on the same 


* This method of holding the papers in place was suggested by Sgt. J. Boory. 
+1, represents the fifth rotation of centroid axis /. 
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sheet. The sheet is so adjusted that the rotated axis of the factor is 
parallel to the vertical lines on the graph paper beneath. See factor 
III, , in Figure 2. 

(2) Then, in the upper right-hand portion of the board, adjust 
the sheet on which the last rotated plot for the second factor appears 
so that the axis of the factor is parallel to the horizontal lines in the 
graph paper. See Factor /; , in Figure 2. 

(3) Next adjust a new sheet so that all the test loadings pro- 
jected horizontally to the right from the lower plot, and projected 
vertically downward from the upper plot will fall on the paper. See 
Figure 2. 





FIG. 2 


(4) Using the T square and triangle, draw horizontal and ver- 
tical axis lines on the new sheet as continuations of the desired axes 
on the two previous plots. Draw these lines lightly to facilitate the 
perceptual discrimination between the plotted axes and the rotated 
axes which will later be drawn with heavier lines. 

(5) Now plot the position of each test on the new sheet. First 
adjust the hairline on the T square so that it passes through the point 
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representing the test number which is farthest upward on the lower 
plot, e.g., variable 9 in Figure 2. Then adjust the vertical hairline 
of the triangle so that it passes through the point representing the 
same number on the upper plot. 

(6) Insert a pencil in the hole in the triangle and mark the 
position of the test on the new plot. Record the number of the test. 

(7) Move the T square hairline downward until it passes 
through the next test point on the lower plot, e.g., variable 1 in Fig- 
ure 2. 

(8) Then adjust the vertical hairline of the triangle to pass 
through the corresponding point on the upper plot. 

(9) Mark the position of this test on the new plot and record 


its number. 
(10) Continue in this manner until all test loadings have been 


projected to the new sheet. 

(11) When the rotation has been determined, draw the new 
axes with a heavier line than those for the original axes. 

(12) Label the new axes. On the old sheets indicate by a cross 
or a check mark the axes which have now been rotated. 

With the apparatus currently in use, the error introduced by dis- 
placing the pencil point from its geometrically accurate position is 
very small. A precision-made apparatus could be constructed to oper- 
ate within almost any limits desired. 

Errors in plotting are unlikely, for it is only necessary to be sure 
that the corresponding numbers are lined up and projected, and that 
the latest rotations of each factor are used for each new plot. Ac- 
curacy depends upon the care with which plots are lined on the board 
and the hairlines adjusted to the test points. 
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A NEW DESCRIPTIVE STATISTIC: THE PARABOLIC 
CORRELATION COEFFICIENT 


CHARLES C. PETERS 
PENNSYLVANIA STATE COLLEGE 


This article proposes a new descriptive statistic related to the 
second order parabola in the same manner in which the familiar 
correlation coefficient is related to the regression coefficient. The 
parabolic r describes in standard terms simultaneously the general 
trend of the regression and the extent and nature of its curvilinear- 
ity, and is relatively easy to compute and easy to communicate. For- 
mulas for its standard error and its limits are derived and it is 
applied to a number of regressions. 


It is a well-known fact that the correlation coefficient expresses 
the slope of a straight regression line on a squeezed scale, and that 
it is an inappropriate and misleading descriptive statistic where the 
relation between two sets of paired variates is not rectilinear. Never- 
theless it has been recklessly used without regard to the rectilinearity 
of the regression. That has been in part due to the absence of any 
convenient descriptive statistic for curvilinear correlational relation- 
ships. We have long had, of course, the equations of parabolas of 
various orders and also a number of other forms of curves. But these 
have not been in the form of standard units so as to make their de- 
scriptions of the regressions readily communicable. It is the purpose 
of this paper to describe a new statistic related to the equation of the 
second-order parabola in the same manner as the correlation coef- 
ficient is related to the regression coefficient, to give machinery for 
translating it into standard meaning, and to develop for it a standard 
error formula and a proof of its limits. I have named this statistic 
the parabolic correlation coefficient. It provides a dual description 
of the regression: (1) an index of the slope (i.s.) exactly equal in 
numerical value to the correlation coefficient; and (2) an index of 
corvilinearity (i.c.) The index of slope is the first derivative of the 
equation of the second-order parabola evaluated at a certain point to 
be described in this article, while the index of curvilinearity is one 
half of the second derivative, each adjusted to a standard meaning by 
a suitable squeezing of the scale. 

The equation for the second degree parabola is 


Y=a + bX + cX?. (1) 
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We wish to get values for the a, b, and c such that the sum of the 
squares of the errors made in predicting Y-scores with this equation 
will be a minimum. In the case of any individual with an X-score in 
the class i the error would be 


(Y:— Yi) = Y;— (a + bX; + eX). 


We want the sum of the squares of these errors to be a minimum. 
Hence we square the expression at the right and sum it termwise. 
But we shall write the terms in a different order, merely for con- 
venience and, for the sake of simplicity, shall drop the subscripts. 
The sum of the squares of the errors is: 


SY? + AeDX* + B*DX2 + Na? — 2c X?Y — 2OSXY 
— 2adY + 2acdX? + 2abSX + 2beDdX°. 


In order to determine values for a, b, and c that will make this ex- 
pression for the sum of the squares of the errors a minimum, we must 
equate to zero the partial derivatives with respect to these three 
terms. These partial derivatives are in order (each equated to zero 
and each divided through by 2): 


cDX* + bX? + aD X?— DTVX*Y=0, 
eS X? + bX? + aX —SXY —0O, 
cSX?+b5X +Na —SY =0. 


This set of simultaneous equations must be solved for the three 
unknowns, a, b, and c. A number of different outcomes, all completely 
equivalent in value, are possible according to the method of handling 
the algebra. One of the most convenient solutions is the following: 
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These are the coefficients which are now to be put in equation (1) 
for predicting Y scores. 

But so far we have an operative tool, not a descriptive statistic. 
We shall turn next to getting the latter. The reader should get in 
mind a conventional correlation chart laying out the independent 
scores on the X-axis and the dependent scores on the Y-axis. We are 
concerned with two features about such outlay: (1) What is the gen- 
eral trend of the Y scores as we go up along the X-axis? and (2) To 
what extent is the regression line curved rather than straight, and in 
what direction is it curved? We can get an account of the first fea- 
ture from the first derivative of the equation of the parabola and an 
account of the second feature from the second derivative. 

The first derivative of equation (1) is b + 2cX. This gives the 
slope of a tangent to the curve at any point at which the derivative 
is evaluated. Its value is dependent upon its place along the X-axis, 
as is evidenced by the fact that it contains an X. If we evaluate it 
near the middle of the X-distribution, we get a tangent that will show 
the genera] trend pretty well for the whole distribution; and if, in 
addition, we make a proper adjustment in our scales, we can have a 
description of the general trend that compares in value with the co- 
efficient of correlation. We shall adjust our scales by stretching or 
squeezing them so as to make the units equal in terms of variability 
on the two axes. This we accomplish by multiplying any increments 
along the X-axis (here the first derivative) by o:/c,. But we shall 
put this in summation form in the first term since we have all the 
other terms in summation form, but shall, for convenience in writing, 
let it stand in o form in the other two terms. So re-writing our first 
derivative thus multiplied by the standard deviation ratio and sepa- 
rating it into three terms, and at the same time putting the 2c over 
the common denominator, and also substituting NM, for >X , we have 








— NSXY —SX-SY a c(N>X? — NM,>X?*) 
NS? — [SXT)) (NSY?— PSYT) Non 
, 2cXN? o,? vd 
iste 


Inspection of the above expression will show that, if we evaluate it 
where the second and third terms aggregate exactly zero, we shall 
have left precisely the formula for the Pearsonian correlation co- 
efficient. Since the denominators are the same, that will be where 
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2cXN? o,? = c(N>X* — NM;>X?). 
Solving this for X , we get : 
os 2 
x= DX? — MX (6) 
2No,? 
So if we evaluate the first derivative at the position on the X-axis 
where X has the value indicated in (6) on our adjusted scale, we 
shall always get for this part of our descriptive statistic exactly the 
numerical value of the coefficient of correlation. Investigation of this 
expression will show that it reduces to the mean in the case of a sym- 
metrical distribution of the X-variable; but we can easily get for the 
needed critical point at which to evaluate it a more generalized value. 
The expression as it stands in (6) is in score form. In order to 
investigate its properties we shall transform it to deviation form. 
The capital X represents a score and the lower case x a deviation 
from the mean of the X’s. So 


X=—x+M,. 





Hence 
SX3 = Sz? + 8M, 2? + 8M? 2 + NM; 
7 
M,>X? = M,>\x? + 2M,?32 +NM-. 7) 
Subtracting the second equation in (7) from the first to get the num- 
erator of (5), remembering that no matter what the shape of the dis- 
tribution Sz = 0, and simplifying, we have 
2M + Fe a)? 
= = = = M, =i 4 Or (2 . 
2No,2 N?o° 
The term under the radical above is, in the Pearson terminology, f, . 
But because the sum of the odd powers may have either the plus or 
minus sign before squaring, and because we wish to preserve the ef- 
fect of the sign, we shall write the equation as follows: 


X=M,+ so.VB:. (9) 


For a normal distribution, or for any svmmetrical distribution, 
8, is zero. So, in order to get for this part of our descriptive statistic 
a value exactly equal to the coefficient of correlation, we must evalu- 
ate it exactly at the mean if the X-distribution is normal or other- 
wise symmetrical. If the distribution is not symmetrical, the critical 
point would be near the mean but a little off to the extent involved 
in (9). But in the computational work the worker need not be con- 
cerned about that. If he will merely compute 





(8) 
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it will follow as an automatic consequence that he has evaluated the 
derivative at the position indicated in (9) and that he has the slope 
of a tangent to the curve exactly parallel to the fitted line the slope 
of which would be the coefficient of correlation. It would, of course, 
be plausible always to evaluate the derivative at the mean, but then 
our value for the 7. s. would not always have a familiar meaning. It 
would seem worth while to sacrifice that point for the practical ad- 
vantage of continuing to carry in our parabolic correlation coefficient 
an element exactly equal to the well-known Pearsonian 7 . 

The above discussion provided an index of slope for our descrip- 
tive statistic. We need for our second element an index of the curvi- 
linearity. This requires the second derivative. The second derivative 
of equation (1) is 2c: But for certain practical reasons discussed be- 
low we shall take for our 7.c. half of that derivative, namely c. But 
we also want it expressed in terms of our adjusted scales in order 
that it may have a standard meaning for all types of distributions 
to which it is applied. That means that we must multiply it by the 
ratio «,2/c,. When this multiplier is put in summation terms and 
used with equation (2) we get for the i.c. equation (10a) below. When 
used with another of the three possible solutions for c, we get equa- 
tion (10b) below. In practice it is desirable to employ both of these 
formulas as an arithmetic check. They should give identical outcomes. 
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The reader may be alarmed at the elaborateness of this formula 
for the index of curvilinearity. Its application does, it must be ad- 
mitted, require some work. But it will be observed that the same 
quantities recur in it, in consequence of which the labor is less than 
would at first appear. The quantities in all of the parentheses lend 
themselves to very economical procedures on a calculating machine. 
There are needed, besides the N , the following moments and product 
moments: 


=x 
yas 
aX? 
2Y? 
DxXY 
_s 
D>X 
=xX* 


The first five of these are needed in the computation of a coefficient 
of correlation; only the last three are additional. They can be very 
easily and quickly obtained when working with a correlation chart, 
though no correlation chart is really needed for this statistic as it is 
for the correlation ratio and the Blakeman Test. Getting them on a 
calculating machine requires going through the scores one additional 
time as compared with the correlation coefficient, and the total time 
to calculate a parabolic 7 from raw scores on a calculating machine 
is about twice that required for a Pearsonian coefficient of correla- 
tion!* The needed moments and product moments can also be ob- 
tained on the I.B.M. machines, but this requires that the X? values be 
punched in the cards as well as the X values. 


So our parabolic correlation coefficient will contain always two 
terms, one giving in familiar form an index of the general slope and 
the other giving an index of the form and the degree of curvilinear- 
ity. If the regression is really rectilinear, the second element will be 
zero and the parabolic correlation coefficient will automatically reduce 
to the Pearsonian r. In any case the parabolic correlation coefficient 
will automatically carry notice of the degree of departure from recti- 
linearity, and we can, if we wish, test that departure for statistical 
significance by the method explained below. A parabolic 7 of, say, 
—.163 — .167 means that there is in our relation a moderate over-all 
downward trend (i.s. = —.163) and there is an upward hump of mod- 
erate amount in the curve (i.c. = —.167). A parabolic r of —.163 + 

* Convenient worksheets for the parabolic correlation coefficient may be ob- 


tained from the Department of Psychology and Education of the Pennsylvania 
State College at one cent each. 











CHARLES C. PETERS 65 


.675 would mean that there is a general downward trend and also a 
distinct downward hump. Thus the nature of the regression can be 
visualized from the statement of the parabolic +r without the need 
for a correlation chart. 

What are the maximum and minimum limits between which our 
index of curvilinearity lies? My proof of these limits rests upon two 
postulates: the i.c. can be a maximum or a minimum only (1) when 
the index of slope is zero, and (2) when the regression law operates 
perfectly to place each Y-value exactly on the curve when allocated 
to its appropriate position along the X-axis. Hence, where a, is the 


standard deviation of the actual Y-values and os is the standard devia- 
tion of the Y’s on the regression line, A 


(a + 0X + cX?*)? + 
v= o=| : —M,*| 
y N 
When this expression is evaluated it results in moments in terms of 
DSX*, SX? , SX*, and SX. These are in score form; they can be more 
easily simplified if transformed to deviation form. Let X = x + M,, 
where the lower case x is a deviation from the mean of the X’s. Rais- 
ing this to the 4th power, summing, and dividing by the N of the de- 
nominator above yields terms in Sa*/N , Sa°/N , Sx?/N, and >2/N. 
The last two are, of course, o,? and zero. The first is f.0;* and the 
second is o,°\/f,;. The X* can be correspondingly treated. When 
these values are substituted, and also values for a and for b from 
equations (3) and (4), and cognizance is taken of the fact that the 
7.8. must be zero, the whole expression reduces to 
oy — Co? V bo — Bi — 1. 

The algebra involved in this is straightforward substitution but to 
carry it through requires several pages, which I do not deem it nec- 
essary to consume here, because anyone who is interested can carry 
through the proof himself from the above hints. We now substitute 
this value for the denominator in formula (10). The o,? is canceled 
by the numerator of the same fraction. The ¢ standing in the sub- 
stituted denominator here is canceled by the lengthy fraction con- 
stituting the left-hand factor of equation (10), which is exactly c. 
Hence the whole reduces to the very simple form 


1 
VBb:— Bi—-1 
The minimum value is had by using the negative sign with the square 


root. The f’s here are the Pearsonian measures of kurtosis and of 
skewness, respectively, of the X-variable (the independent variable). 








4.¢.™8%- = 
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For a normal distribution (§. = 3 and £, = 0) the maximum limit 
of the ¢.c. would be .707. .. . But for a rather platykurtic distribution 
the i.c. could reach +1.00 or even, though rarely in practice, exceed 
those limits. It is because of the desirability of pairing two elements 
in our parabolic 7 that are confined to substantially the same limits 
that we chose half of the second derivative for our i.c. instead of the 
second derivative as it stood. It is, perhaps, a further justification 
that the i.c. then becomes just the c of the parabolic equation, though 
on a squeezed scale. 

Statistical workers accustomed to the conventional testing for 
reliability will ask for standard error formulas for this statistic. Since 
the parabolic correlation coefficient is merely a descriptive statistic 
and has no functional meaning as a whole, we do not need a standard 
error formula for it as a whole, though one could be developed if 
wanted. We are interested only in testing the reliability of its two 
parts. For testing the hypothesis of a true slope of zero, the standard 
error of the index of slope where we have evaluated it is merely that 
long familiar for testing the corresponding hypothesis in the correla- 


tion coefficient: 
i: 


VN—1 
But we desire a formula for the standard error of the index of curvi- 


linearity. Deming* has shown that the sampling variance of the co- 
efficient of X? in a second-order parabola, c, is 


S.E.i.s. — 





"Pe 


S£.2= , (11) 
(N — 3) (S2*/N — o*) 
where o,? is the variance of the Y-array in the sample and the z’s in 
the denominator are deviations from the X-mean. But that form 
would be too inconvenient in use. We want a form that can be easily 
applied mentally as soon as we hear the index of curvilinearity and 
know the N. We can simplify (11) by assuming normality of distri- 
bution in the X-variable. Normality is not a very violent assump- 
tion in the X-variable, although it would be for the Y-variable if there 
is marked curvilinearity. 
>z*/No-* is f. in the Pearson terminology. So Sa*/N = f. o:*. 
But in a normal distribution, 8, = 3. So the first term in the paren- 
theses in (11) equals 3c*. This combines with the second member to 
make 2c*. Making this substitution and taking the square root, we 
have for the standard error of the coefficient c , 





* Deming, W. E. Statistical adjustment of data. John Wiley and Sons, 1943. 
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S.E..= = (12) 
a? V2(N— 38) 





But our index of curvilinearity is be c. On the principle that o,..= de, 
Cy 
we would then have for the standard error of our index of curvi- 
linearity: 
oz" Oy 1 
S.E.i.c.=—: — ‘. (13) 
oy of V2(N—3) V2(N—8) 


In mathematical theory this has the same meaning and use as 
the standard error of any other statistic, and is to be divided into the 
c to get the standard error ratio. But in this sort of situation, as in 
many others, the worker should be warned against the customary ac- 
ceptance of the hypothesis that there may be no true curvilinearity 
until the standard error ratio reaches the five per cent or the one per 
cent level. A test of statistical significance is properly employed to cau- 
tion the worker against proceeding to action on the basis of his find- 
ings when there still remains considerable possibility that the differ- 
ence tested may have arisen by chance fluctuation in sampling. In an 
experiment on methods of teaching, for example, one should withhold 
confident action even when the odds are 90 to 10 that the true differ- 
ence is on the side on which he found it, for they are still 10 to 90 that 
the true difference might turn out to be on the other side. But in re- 
spect to taking warning that there may be true curvilinearity in our 
regression the danger of over-hasty action lies on the opposite side; 
here the unwarrantedly hasty action would consist in disregarding 
the evidence with low probability that there may be true curvilinear- 
ity and proceeding to treat the regression as rectilinear. So, whereas 
we hold forth for a standard error ratio of two or two and a half 
before we claim statistical significance in an experiment, we would 
better take warning from a standard error ratio of one in this con- 
nection (as, indeed, in a number of other situations of like charac- 
ter). The actual numerical size of the i.c. is more important that its 
standard error ratio. An i.c. of .06 or .10 or higher arithmetically 
shows an appreciable amount of curvilinearity. 

In the table below are reported a number of parabolic r’s . These 
are not a random sample nor are they selected as only the ones show- 
ing curvilinearity; they are ones computed from data in which it 
seemed plausible that we might find curvilinearity. In order to ap- 
preciate the phenomena involved in them, it may be well to ap- 
proach the table by speculating about what might be expected to be 
the relation. A number of the parabolic 7’s are ones computed be- 
tween scores in subject fields made at a single long examination by 
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Parabolic Correlation Coefficients Between Indicated Pairs of Variates 





Variates 1.8. ic. S.E.;.. Ratio 
BAIBUOTY ON MABUNOMALICS  —.——a..-....neensnisnecnccsenesses + 08 —.168 .067 2.49 
Percentage of votes for Willkie by counties in 

Pa. on density of population ........................ —.291 +.208 .089 2.34 


Percentage of votes for Willkie in 1940 by 
Counties in Ohio on density of population —.402 +.011 077 14 


Otis Classification test on age at eighth- 


Oy Ld Lo ae cea nei tp oe Renae —.163 —.167 .045 3.71 
Literature on grade-point average ....... ...0........-- +.347 —.138 .067 2.04 
Literature on mathematics ......................cs0:e00000-+- +.113 —.125 .067 1.85 
Belief in superstitions on I1.Q. _...00..02...2.20.2---- —.199 —.054 021 2.57 
Belief in superstitions on no. of magazines 

EEE DNS Sa —.145 —.087 .021 1.76 
Belief in superstitions on no. of semesters taken 

“RSC EES Re ee ee —.101 +.033 021 1.50 
Impulsiveness on success in nursing ..................-. —.265 —.125 .072 1.74 
Indecision on success in nursing ......................---- —.140 —.054 .072 75 
Worry on success in nursing ...................20--0200-+-- —.027 —.041 .072 57 
ee | i ra ra +.408 —.111 .067 1.64 
Grade-point average on math. ....000.0222000020eeeee +.321 —.111 .067 1.64 
Mathematics on literature .........0.00.0.....cceeseeeeeeeee +.113 —.109 .067 1.61 
English on grade-point average ...............----...------ +.528 —.081 .067 1.17 
Washburne control scores on academic achieve- 

ORR 2 Se De ee a ee —.084 +.093 .041 2.27 
Washburne alienation scores on academic 

ESS 2 ES SSRs em +126 +.052 041 1.27 
Happiness scores on academic achievement ........ +.042 +.053 .041 1.29 
Number of siblings on academic achievement 

ge ES ER IEEE At Pater SOLE Fee Pee ee ats aU —.091 —.006 041 15 
Science on general intelligercce -........................-+- +.306 +.068 .067 1.01 
Mmeish On MAtREMACS ............-..<. 22-206 0-sce.-noeae-ac--- +.162 —.062 .067 92 
ene coor ek rn See +.463 —.062 .067 .90 
Peeters ON MIGUOTY. .........--— -<-2c0ssibcsesnnecncoencccee +.082 +.022 .067 32 
Moore-Nell academic achievement test on be- 

Derupe seminary. foo) ae +.0383 —.010 .051 19 
English total on intelligence -.....................:-.:-00-+ +.728 +.022 .067 33 





113 sophomores at the Pennsylvania State College on the Carnegie 
Foundation Achievement Tests. As an individual increases his abil- 
ity in mathematics, what is likely to happen in his measured abilities 
in history? The two abilities may be expected to increase together 
at the low end on account of the common elements of reading ability, 
general intelligence, etc. But in the later stages those persons who 
go in much for mathematics are likely not to be going in as much for 
history, and hence at the upper levels the curve relating the two could 
be expected to bend downward from the straight regression line that 
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would have fitted the early stages. Thus there would be a humped 
up curve for the regression line, represented by a negative index of 
curvilinearity. That is exactly what inspection of the first line in the 
table shows to be the fact. Nearly all the other parabolic r’s bear out 
similarly the hypotheses one would set up for them as plausible. In 
the table the first term in each pair denotes the dependent variable 
and the second term the independent variable. Thus history on mathe- 
matics shows the manner in which the history scores regress as one 
goes up the scale of mathematics scores. 

We have talked in terms of a second-order parabola. Parabolas 
of higher order could, of course, also be used and the descriptive sta- 
tistic would need to have as many terms as the order of the parabola. 
But beyond the second order the computational work would greatly 
increase and the difficulty of communicating the results and of visual- 
izing their meaning would likewise increase. It would also be possible 
to set up descriptive statistics in terms of other curves, such as the 
Gompertz or other exponential curves. But they also would be more 
awkward to compute and to communicate. Moreover, the second-order 
parabola will serve pretty well to detect and describe curvilinearity 
through the ranges through which we usually measure. We are likely 
to be measuring growth some time after its initial period, when it is 
in the upper bend of the S of the Gompertz curve; and in relations 
which are fitted by curves of the form Y = e”, we are also likely to 
be working within a range in which the second-order parabola will 
not fit too badly. But for actual prediction rather than rough descrip- 
tion we should resort to the fitting of the needed type of curve. It 
would thus appear that the constants of a second-order parabola will 
make at least a satisfactory start in a more adequate description of 
regression than the sole resort to correlation coefficients upon which 
we have hitherto relied. Although our experience suggests that a dis- 
turbing amount of curvilinearity in regression is far from the rule, 
it does occur sufficiently frequently that the careful researcher will 
wish to be on the lookout for it. It is particularly likely to be missed 
when correlation coefficients are run on a calculating machine, or on 
the I.B.M. machines from ungrouped scores, where, consequently, 
there is no inspectional warning of the curvilinearity. 

















